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Systems engineer Qian Xuesen had fans in high places. 
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1200 COMPLEX BEHAVIOR AROSE AT 
DAWN OF HUMANS 

Advanced stone tools, pigment, 

and extensive networks emerged as 
environment changed 

By A. Gibbons 

> R. POTTS ETAL. 10.1126/science.aao2200; 


A. S. BROOKS ET AL. 10.1126/science.aao2646; 


A. L. DEINO ET AL. 10.1126/science.aao2216 


1201 NIGERIA HIT BY UNPRECEDENTED 
LASSA FEVER OUTBREAK 

As efforts to contain it mount, 
researchers are racing to find out 
what is driving this year’s surge 

in cases and deaths 

By L. Roberts 


1202 VIBRATIONS USED TO TALK TO 
QUANTUM CIRCUITS 

Sound waves could supplant 
microwaves in controlling quantum 
computers By A. Cho 
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1204 MAX PLANCK SOCIETY, 

AT A CROSSROADS, SEEKS NEW LEADERS 
A wave of retirements offers a chance to 
recruit female leaders and open up new 
research avenues 

By K. Kupferschmidt 


1205 CONCERN AS HIV PREVENTION 
STRATEGY LANGUISHES 

Few countries promote prophylactic 
drug regimen, and many people shun it 
By J. Cohen 
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1206 MASTER PLANNER 


China’s revered rocket scientist, Qian 
Xuesen, set in motion a system for 
engineers to control Chinese society 
By M. Hvistendahl 


INSIGHTS 


PERSPECTIVES 


1210 RHYTHMS: THE DARK SIDE MEETS 
THE LIGHT 

Researchers examine the 24-hour 
biological clock in daytime-active 
baboons By A. Millius and H. Ueda 

> RESEARCH ARTICLE P. 1232 
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Early stirrings of 
modern behavior 


1212 CIRCADIAN ORGANIZATION 

OF THE GENOME 

The clock protein Rev-erba regulates 
genome folding to establish circadian 
gene repression 

By C. Diettrich Mallet de Lima 

and A. Gondor 

> REPORT P. 1274 


1213 ALANDSCAPE OF DISGUST 
Parasite avoidance behavior 

affects ecology and evolution in 
ways similar to predator avoidance 
By S. B. Weinstein et al. 

> PODCAST 


1214 LOGIC IN BABIES 
12-month-olds spontaneously 
reason using process of elimination 
By J. Halberda 

> REPORT P. 1263 


1216 RANDOM COPOLYMERS THAT 
PROTECT PROTEINS 

Synthetic polymers are designed 

to stabilize proteins in polar 

and nonpolar solvents 

By A, Alexander-Katz and R. C. Van Lehn 
> REPORT P. 1239 


1217 BEYOND PARP—POL® 

AS AN ANTICANCER TARGET 
Targeting cancers dependent on 
DNA polymerase 6 has considerable 
clinical potential By G. S. Higgins 
and S. J. Boulton 
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1219 SOCIETAL INEQUALITIES AMPLIFY 
GENDER GAPS IN MATH 

Egalitarian countries cultivate 
high-performing girls 

By T. Breda et al. 


BOOKS ETAL. 
1221 WOMEN’S WORK 


Stepping into scientific roles 
left vacant during World War I 
brought gains for women in and 
out of the laboratory 

By L. M. Puaca 


1222 LIFE IN TRIPLICATE 

A tale of triplets separated at birth 
raises red flags and questions about 
the role of nature and nurture 

By G. Kardon 
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LETTERS 


1224 ARRIVAL ROUTES OF FIRST 
AMERICANS UNCERTAIN 
By B. A. Potter et al. 


1225 RESPONSE 
By T. J. Braje et al. 


1225 POST-PUBLICATION PEER REVIEW: 
A CRUCIAL TOOL 
By G. I Peterson 


1226 TECHNICAL COMMENT ABSTRACTS 
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1227 From Science and other journals 


RESEARCH ARTICLES 


TOPOLOGICAL PHOTONICS 

1230 Topological insulator laser: 
Theory G. Harari et al. 
RESEARCH ARTICLE SUMMARY; FOR FULL 
TEXT: dx.doi.org/10.1126/science.aar4003 


1231 Topological insulator laser: 
Experiments M. A. Bandres et al. 
RESEARCH ARTICLE SUMMARY; FOR FULL 
TEXT: dx.doi.org/10.1126/science.aar4005 


1232 CIRCADIAN RHYTHMS 

Diurnal transcriptome atlas of a primate 
across major neural and peripheral 
tissues L. S. Mure et al. 

RESEARCH ARTICLE SUMMARY; FOR FULL TEXT: 
dx.doi.org/10.1126/science.aao0318 
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1233 HUMAN GENOMICS 

Phenotype risk scores identify patients 
with unrecognized Mendelian disease 
patterns L. Bastarache et al. 
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1239 POLYMERS 

Random heteropolymers preserve 
protein function in foreign environments 
B. Panganiban et al. 

> PERSPECTIVE P. 1216 


1243 SURFACE CHEMISTRY 

Real-time imaging of adatom-promoted 
graphene growth on nickel 

L. L. Patera et al. 


1247 ENZYMOLOGY 

Organometallic and radical 
intermediates reveal mechanism of 
diphthamide biosynthesis M. Dong et al. 


1251 INDUCED SEISMICITY 

Oklahoma’s induced seismicity strongly 
linked to wastewater injection depth 

T. Hincks et al. 


1255 FISHERIES 

Protecting marine mammals, turtles, 
and birds by rebuilding global fisheries 
M. G. Burgess et al. 


1259 MALARIA 

GDV1 induces sexual commitment of 
malaria parasites by antagonizing HP1- 
dependent gene silencing 

M. Filarsky et al. 


1263 COGNITIVE SCIENCE 
Precursors of logical reasoning 
in preverbal human infants 

N. Cesana-Arlotti et al. 

> PERSPECTIVE P. 1214 


1266 NITROGEN CYCLE 

Factoring stream turbulence into global 
assessments of nitrogen pollution 

S. B. Grant et al. 


1269 NEURODEVELOPMENT 
Astrocyte-derived interleukin-33 
promotes microglial synapse 
engulfment and neural circuit 
development J. D. Vainchtein et al. 
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Plastics helping proteins 


1274 CIRCADIAN RHYTHMS 

Rev-erba dynamically modulates 
chromatin looping to control circadian 
gene transcription Y. H. Kim et al. 

> PERSPECTIVE P. 1212 


1277 AGING 

Lysosome activation clears aggregates 
and enhances quiescent neural stem cell 
activation during aging 

D.S. Leeman et al. 


1283 SINGLE-CELL ANALYSIS 

Mutation dynamics and fitness effects 
followed in single cells L. Robert et al. 
> VIDEO 


DEPARTMENTS 


1195 EDITORIAL 
Good gun policy needs research 
By Alan I. Leshner and Victor J. Dzau 


1294 WORKING LIFE 
Instagram won't solve inequality 
By Meghan Wright 


Illustration of a non- 
magnetic topological 
insulator laser: an 
array of microring 
resonators (circles) 
coupled by waveguides 
(ellipses) in a topo- 
logical fashion. The red 
perimeter depicts the 
lasing topological edge mode, which exits 
through one of the two outputs (solid red 
triangle). Implementation of this design in 
an all-dielectric semiconductor enhances 
lasing performance and makes many diode 
lasers operate together as a single highly 
coherent laser. See pages 1230 and 1231. 
Illustration: Valerie Altounian/Science 
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EDITORIAL 


Good gun policy needs research 


he tragic shooting at a school in Parkland, 
Florida, last month triggered another round of 
proposals from local, state, and federal policy- 
makers about controlling firearm-related vio- 
lence without violating broad interpretations of 
the rights to keep and bear arms provided by the 
U.S. Constitution. Unfortunately, there is only 
very sparse scientific evidence that can help figure out 
which policies will be effective. Earlier this month, the 
RAND Corporation released a comprehensive analysis 
on gun policy in the United 
States,* and among its con- 
clusions is that too few poli- 
cies and outcomes have been 
the subject of rigorous sci- 
entific investigation. Even 
the seemingly popular view 
that violent crime would be 
reduced by laws prohibiting 
the purchase or possession 
of guns by individuals with 
mental illness was deemed to 
have only moderate support- 
ing evidence. If the nation 
is serious about getting fire- 
arm-related violence under 
control, it must rise above 
its aversion to providing fi- 
nancial support for firearm- 
related research, and the sci- 
entific community will have 
to expeditiously carry out the 
needed research. 

There used to be more 
federally funded research on 
firearm-related violence than 
there is now. Although its program was small relative to 
other public health issues, the U.S. Centers for Disease 
Control and Prevention (CDC) did support research on 
firearm violence through its National Center for Injury 
Prevention and Control. However, the 1996 “Dickey 
Amendment” prohibited the CDC from funding activities 
that promoted or advocated for gun control. Although ar- 
guably some research might still have been considered ac- 
ceptable, the amendment was interpreted as an outright 
prohibition of CDC support for any gun violence research. 
In 2011, Congress enacted similar restrictions affecting 
the Department of Health and Human Services, resulting 
in a dearth of scientific activity on any aspect of the avail- 
ability and possession of firearms and the violence that 
might be related to them. 


“..t00 few policies and outcomes 
have been the subject of rigorous 
scientific investigation.” 


In 2013, shortly after the Sandy Hook Elementary 
School shooting in Connecticut, President Obama issued a 
series of Executive Orders directing federal science agen- 
cies to again support research on firearm-related violence. 
In response, the CDC and the CDC Foundation asked the 
Institute of Medicine (IOM) and the National Research 
Council (NRC) (now part of the National Academies of 
Sciences, Engineering, and Medicine) to draw up a public 
health research agenda that the CDC might implement. 
One of us (A.I.L.) chaired the committee that produced 
the resulting consensus report 
and research agenda.t Unfor- 
tunately, nothing came of it, 
because although CDC has 
occasionally requested funds 
to support relevant research, 
Congress consistently has said 
“no.” The National Institutes 
of Health also supported rel- 
evant research beginning in 
2013, but that program was 
discontinued in 2017 as other 
priorities replaced it. 

It’s time to stop the po- 
larized “debates” that lack 
a science base and turn our 
energies toward constructive, 
informed examination. The 
IOM-NRC report has spelled 
out a research path that calls 
for a closer examination of 
the characteristics of firearm- 
related violence; the risk and 
protective factors (like grow- 
ing up in violence-prone en- 
vironments) that increase 
the probability of firearm violence; the effectiveness of 
diverse violence prevention and other interventions; and 
the impact of various gun safety technologies. And the 
RAND analysis calls for research on the effects of specific 
firearm policies, such as whether background checks 
that investigate all types of mental health histories do re- 
duce gun injuries, suicides, and homicides and whether 
raising the minimum age for purchasing firearms (to 21 
years old) reduces firearm suicides among youth. 

Without science to drive firearm policy development 
and implementation, we risk inventing policies based on 
personal ideology or intuition. If we are serious that gun 
violence is a major public threat, then let’s rise to the mo- 
ment and take the next science-based steps. 

-Alan I. Leshner and Victor J. Dzau 


*www.rand.org/pubs/research_reports/RR2088.html; thttps://doi.org/10.17226/18319 
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Ultima Thule 


The name chosen by NASA for the next mission by the New Horizons spacecraft, to MU69, 
a remote object beyond Pluto. Thule is a mythical, far-northern island. 


I N B R I E F Edited by Jeffrey Brainard 


FUSION ENERGY 


Compact reactor vies with ITER project 


The SPARC reactor, shown in an illustration, could lower costs of fusion power. 


he Massachusetts Institute of Technology (MIT) in Cambridge 
has spun off a company to develop a compact reactor that 
could lead to practical fusion power. The Italian oil and gas 
company Eni agreed to pump $50 million into an MIT startup 
called Commonwealth Fusion Systems developing the Soonest/ 
Smallest Private-Funded Affordable Robust Compact (SPARC) 
reactor. A fusion power plant would use powerful magnets to confine 
a superheated hydrogen plasma, in which nuclei fuse and give off en- 
ergy. The U.S. government has squeezed its funding of fusion research, 
including to the international ITER reactor project being built near 
Cadarache in France. SPARC is designed to generate power in a reactor 
a fraction of ITER’s size and cost. It would rely on magnets made from 
high-temperature, copper-based superconductors that can handle the 
strong currents in a high magnetic field. The team aims to make mag- 
nets more powerful than ITER’s at a smaller scale. Commonwealth 
Fusion Systems has a rival in Tokamak Energy, a startup in Oxford, 
U.K., that has been building a similar reactor with high-temperature 
superconducting magnets (Science, 28 April 2017, p. 360). 
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Columbia neuroscientist ousted 


WORKPLACE | Prominent neuroscientist 
Tom Jessell was fired by Columbia 
University and the Howard Hughes 
Medical Institute (HHMI) in Chevy 

Chase, Maryland, earlier this month after 
a university investigation. The probe, 
Columbia said in a statement, “revealed 
serious violations of University policies and 
values governing the behavior of faculty 
members in an academic environment.” 
HHMI, which had supported Jessell’s work 
at Columbia since he arrived there in 1985, 
ousted him on 2 March, citing a violation 
of its policy. Neither Columbia nor HHMI 
would describe the nature of the violations. 
HHMI, citing personnel policy, would not 
say whether it conducted its own investi- 
gation or relied on Columbia’s findings. A 
Columbia official said Jessell’s lab, which 
listed 22 staff and students in January, 

will be disbanded over the next 15 months. 
Scores of former and current Columbia 
students and employees protested the uni- 
versity’s refusal to describe specific reasons 
for Jessell’s dismissal in an online letter to 
President Lee Bollinger and the university’s 
trustees dated 10 March. Jessell, 66, identi- 
fied mechanisms involved in the neural 
development and organization of the spi- 
nal cord, revealing principles now known 
to apply throughout the brain. 


Help wanted at NASA 


SPACE EXPLORATION | NASA’ acting 
administrator, Robert Lightfoot, who 

has led the agency since January 2017, 
announced this week that he will retire 

on 30 April. The move places pressure on 
President Donald Trump’s administration 
and the Senate to resolve a battle over lead- 
ership for the agency. Last September, the 
White House nominated Representative 
Jim Bridenstine (R-OK) to head NASA, but 
the nomination has stalled in the Senate 
because of opposition from Democrats and 
Florida’s two senators, Bill Nelson (D) 

and Marco Rubio (R). Both have said a 
“space professional” should lead NASA. 
Lightfoot’s surprise retirement could force 
action, says John Logsdon, professor emeri- 
tus at The George Washington University’s 
Space Policy Institute in Washington, D.C. 
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“The Senate should vote up or down on 
Bridenstine, or the White House should 
convince him to withdraw” and name a 
new nominee. 


Scientists face the voters 


U.S. ELECTIONS | Central Texans embraced 
a former community college math pro- 
fessor as the slew of scientists running 

for U.S. Congress this year got their first 
feedback from voters in primary elections 
last week. Mary Wilson, now a minister in 
Austin, won a plurality in the Democratic 
primary for the 21st congressional district, 
topping engineer Joseph Kopser, an Army 
veteran and clean energy entrepreneur 
from San Antonio. The two will face off 
on 22 May, with the winner vying for 

an open seat in November in the solidly 
Republican district. Houston clinical 
oncologist Jason Westin fell short in the 
seventh congressional district, finishing 
third in a seven-person race. Next week, 
the nation’s attention will shift to Illinois, 
where Representative Dan Lipinski, a con- 
servative Democrat and longtime science 
advocate, faces a serious primary chal- 
lenge from progressive businesswoman 
Marie Newman. The governor’s race is 
also worth watching: In the Democratic 
primary, state legislator Daniel Biss, who 
received a Ph.D. in theoretical mathemat- 
ics from the Massachusetts Institute of 
Technology in Cambridge, appears to be 
closing the gap against hotel heir J. B. 
Pritzker. The winner will likely face incum- 
bent Republican Bruce Radner. Follow 


WILDLIFE 


French court rules for more bears 


he French government hasn't done enough to 
protect the country’s fragile population of European 
brown bears and must pay €8000 each to two pro- 
bear lobby groups, a court in the city of Toulouse 
ruled on 6 March. Almost 40 brown bears (Ursus 
arctos) now live in the French Pyrenees, thanks in part to 
introductions from Slovenia in 1996 and 2006. But their 
presence is controversial; sheep farmers claim that bear 
attacks on their flocks are threatening their livelihoods, 
despite a system to compensate them for such losses. 
The two groups said the government had to introduce 
more bears to help out two lonely males and increase the 
population's genetic diversity. The court says France's 
failure to implement a bear conservation strategy consti- 
tutes a breach of the European Union's Habitats Directive, 
acommon conservation policy adopted in 1992. 


Veterinarians carry a brown bear captured in Slovenia for transport 


to France in 2006 to help build the population there. 
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Multiplying genomes 


The number of DNA sequences from humans and other species stored in GenBank at the U.S. National 
Center for Biotechnology Information has expanded rapidly in 25 years. By 2025, sequencing projects are 
expected to have generated at least as many data as the field of astronomy, YouTube, or Twitter—three 


other major producers of data. 
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these and other scientists running for 
Congress with our candidate tracker at 
http://scim.ag/Sciencecandidates. 


Physics popularizer suspended 


WORKPLACE | Arizona State University 
(ASU) in Tempe placed physicist and 
outspoken atheist Lawrence Krauss on 

paid administrative leave and banned him 
from campus on 6 March, on the heels of 

a 22 February BuzzFeed article in which 
several women alleged sexual misconduct 
by Krauss, including unwanted touching. 
The university says it has received no formal 
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2005 2008 2011 2014 2017 


complaints against Krauss from students, 
staff, or faculty and began an investigation 
only after BuzzFeed reporters began asking 
about him. (An Australian microbiologist 
not affiliated with ASU filed a complaint 
about Krauss with the university in 2017.) 
On 7 March, Krauss tweeted a nine-page 
rebuttal to the BuzzFeed article to his nearly 
half-million followers, saying the story 
presented “false allegations” and “absurd 
claims.” Also last week, Krauss resigned as 
chair of the board of sponsors of the Bulletin 
of the Atomic Scientists, saying he didn’t 
want the allegations to distract the organiza- 
tion. In addition, the American Humanist 
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Part of Trindade Island, 
off of Brazil's coast, would 
be protected. 


CONSERVATION 


Brazil to create marine reserves 


razil’s government plans to next week create two new marine protected areas, 
some of the world’s largest, but controversy has arisen over their boundaries. 
The preserves, centered around the remote Saint Peter and Saint Paul 
Archipelago and Trindade Island, harbor unique marine ecosystems and dozens 
of endemic species. Each of the protected areas will be about the size of Sweden, 
and together they will increase the extent of set-asides in Brazil's coastal and oceanic 
waters to 25%, up from only 1.5%. Scientists protested that boundaries of no-take 
zones within the protected areas did not appear to include the islands themselves, 
where Brazil’s navy maintains bases, and shallow reef systems around them. The navy 
says portions of the islands will be included. Scientists in Brazil want the government 
to also preserve coral reefs and other endangered environments in coastal waters, 
where conflicts with economic activities are more common than in the protected 


areas, which contain mostly open ocean. 


Association, a group in Washington, D.C., 
that represents atheists, removed Krauss 
from its pool of speakers and writers and 
said it is reviewing his 2015 Humanist of the 
Year award. 


Nerve agent in U.K. attack ID’ed 


CHEMICAL WEAPONS | The nerve agent 
used against a former Russian spy and 
his daughter was a potent binary chemi- 
cal weapon, U.K. Prime Minister Theresa 
May told Parliament this week. Binary 
nerve agents are comprised of organo- 
phosphate compounds that are relatively 
safe to handle on their own but become 
highly toxic after being mixed together. 
May identified the weapon as Novichok, 
or “newcomer,” a label for a once-secret 
class of nerve agents developed by the 
Soviet Union in the 1970s and 1980s. She 
did not reveal which Novichok was used 
in the 4 March attack in Salisbury, U.K., 
on Sergei and Yulia Skripal, who remain 
hospitalized, along with a responding 
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police officer. Unlike other nerve agents 
that are liquids and vaporize, Novichok 
compounds—some of which may be the 
most toxic nerve agents ever created—are 
believed to exist in the form of an ultrafine 
powder, says Michelle Carlin, a forensic 
chemist at Northumbria University in 
Newcastle upon Tyne, U.K. 


Science overhauled in China 


RESEARCH POLICY | China this week 
unveiled a sweeping government overhaul 
that could have major implications for 
the government’s role in science and 
public health. Chinese scientists worry 
about the plan’s call to subsume the 
Natural Science Foundation of China—the 
nation’s main source of peer-reviewed 
funding—under the science ministry, 
which has tended to take an opaque 
approach to doling out its research funds. 
Experts welcomed other aspects of the 
plan. It would shift responsibility for 
antismoking efforts from the industry 
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ministry—which regulates China’s tobacco 
monopoly—to the health ministry. That’s 
“potentially a real breakthrough moment 
for tobacco control,” says the World Health 
Organization’s Angela Pratt in Manila. And 
the consolidation of policies on pollution 
and climate change under a new Ministry 
of Ecological Environment could lead to 
more “coherent policies and regulations,” 
says Ranping Song of the World Resources 
Institute in Washington, D.C. 


Prepping for disease X 


PUBLIC HEALTH | A network of laborato- 
ries and clinics across Africa will receive a 
€10 million starting grant to help prepare 
for “disease X,” the as-yet-unknown cause 
of the next major global pandemic. In 
February, the World Health Organization 
listed disease X as one of its R&D pri- 
orities. On 11 March, a consortium of 

21 organizations from nine African and 
four European countries launched the 
African Coalition for Epidemic Research, 
Response and Training to coordinate 
clinical research, surveillance, diagnostics, 
and treatment in response to outbreaks. 
The group aims to bolster labs and clin- 
ics across the continent and coordinate 
rules and regulations across countries to 
remove some of the bureaucratic red tape 
that hampered the response to the Ebola 
outbreak in West Africa. 


Volcanic winter spared humans 


ARCHAEOLOGY | Early modern humans 

in South Africa appear to have survived 
just fine after a colossal volcanic eruption 
about 74,000 years ago spewed ash into 
the atmosphere and triggered a global 
“volcanic winter.” Some researchers have 
hypothesized that the global effects nearly 
wiped out early populations of Homo 
sapiens. But new evidence of their appar- 
ent resilience comes in a study, published 
this week in Nature, that analyzed the 
presence of cryptotephra, tiny particles of 
glass spewed by the volcano, at archaeo- 
logical sites on the coast of South Africa. 
Researchers identified a layer of volcanic 
glass, chemically traced to the explosion of 
Toba on the Indonesian island of Sumatra, 
that was probably deposited within a 
month of the explosion. Tellingly, artifacts 
found just below and directly above this 
layer showed no gap in human use of 

the site. The glass offers a powerful new 
marker to precisely date and correlate 
ancient sites across the globe. 


S SCIENCEMAG.ORG/NEWS 
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Complex behavior arose at dawn of humans 


Advanced stone tools, pigment, and extensive networks emerged as environment changed 


By Ann Gibbons 


ore than 320,000 years ago in the 

Rift Valley of Africa, some early 

innovators adopted a new technol- 

ogy: They eschewed the clunky, 

palm-size stone hand axes that 

their ancestors had used for more 
than a million years in favor of a sleek new 
toolkit. Like new generations of cellphones 
today, their Middle Stone Age (MSA) blades 
and points were smaller and more precise 
than the old so-called Acheulean hand axes 
and scrapers. 

These toolmakers in the Olorgesailie Ba- 
sin in Kenya chose as raw materials shiny 
black obsidian and white and green chert, 
rocks they had to get from distant sources 
or through trade networks. In another first, 
they chiseled red and black rocks, probably 
to use as crayons to color their bodies or 
spears—an early sign of symbolic behavior. 
“This is indicative of a gear change in behav- 
ior, toolmaking, and material culture,” says 
evolutionary psychologist Robin Dunbar at 
the University of Oxford in the United King- 
dom, who studies social networks. 

Atrio of papers released online in Science 
today documents this remarkable techno- 
logical transition. Although other sites have 
yielded MSA tools, the new, securely dated 
chronology nudges the transition back by at 
least 20,000 years, matching when our spe- 
cies, Homo sapiens, is now thought to have 
emerged. By analyzing artifacts over time 
at one site, the papers also show that these 
behaviors developed as climate swings in- 
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tensified, supporting the idea that environ- 
mental variability drove innovation. 

A team led by paleoanthropologists Rick 
Potts of the Smithsonian Institution’s Na- 
tional Museum of Natural History and 
Alison Brooks of The George Washington 
University, both in Washington, D.C., gath- 
ered artifacts from sediments spanning 
1.2 million years at Olorgesailie. Unfor- 
tunately, ancient erosion stripped away 
layers between 499,000 and 320,000 years 
ago, erasing the time when the MSA was 
probably invented. But by looking at more 
than 20,000 animal fossils associated with 
tens of thousands of stone tools, and mul- 
tiple clues to the ancient environment, 
the team provides a detailed picture of 
life before and after the transition to the 
MSA. They pin down the timing with what 
geochronologist Michael Storey of the 


Natural History Museum of Denmark in 
Copenhagen calls “very impressive” dating. 

About 900,000 years ago, a skull cap shows 
that the human ancestor H. erectus lived at 
Olorgesailie and used big Acheulean hand 
axes and scrapers to butcher meat. About 
800,000 years ago, the climate began fluctu- 
ating more intensely from wet to dry, and the 
environment became more arid and grassy. 
At about 615,000 years ago, early humans 
began to make smaller Acheulean tools they 
could carry farther, and to more carefully se- 
lect basalt as raw material, perhaps shifting 
hunting tactics in a changing environment, 
Potts says. The last hand ax at the site dates 
to 499,000 years ago, before the gap. 

By the time the archaeological record re- 
started at 320,000 years ago, the Acheulean 
tools were gone and the basin had changed 
dramatically. The wet-dry cycle was even 


The roots of modern human behavior 


Many complex technologies and signs of symbolic behavior, such as sophisticated Middle Stone Age (MSA) tools 
and pigment use, appeared first in Africa and then in fits and starts around the world. 


Olorgesailie in Kenya 

Oldest securely dated MSA tools, use of 
red and black pigment, long-distance 
transport of obsidian. 


MSA tools. 


Jebel Irhoud in Morocco 
Oldest fossils of early 
Homo sapiens, MSA tools. 


Emergence of 
Acheulean stone 
tools in Africa. 


| | 


~1.7 million ~320,000 ~300,000 years ago 


Published by AAAS 


Omo Kibish in Ethiopia 
Widely accepted fossils of H. sapiens, 


Pinnacle Point 


in South Africa 
Shellfish harvests, use 


of red ochre pigment. 


Africa, Asia, and Europe 
Shaped bone and ivory 
tools, notation systems, 
stunning figurative art. 


Skhul Cave in Israel 


Creation of perforated 
shell beads. 


r | 
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At Olorgesailie in Kenya, big hand axes (left) gave way 
to smaller, more precise blades and points (right). 


more extreme. More than 80% of mammal 
species had vanished and new kinds of el- 
ephants, pigs, foxes, and springboks gathered 
at tree-lined streams. MSA tools—relatively 
sophisticated blades and points that would 
have been hafted onto spears—were plentiful. 

The site yielded no human fossils in this 
key time frame, so researchers can’t be sure 
who the new toolmakers were. But discover- 
ies elsewhere offer a strong hint. For years 
archaeologists had thought the MSA tools 
were too old to have been made by our spe- 
cies. Then, last year, fossils resembling H. sa- 
piens were found near MSA tools and dated 
to nearly 300,000 years ago at Jebel Irhoud 
in Morocco (Science, 9 June 2017, p. 993)— 
timing that fits the Olorgesailie chronology. 

Features of the MSA tools also suggest they 
were the handiwork of sophisticated humans. 
The toolmakers were highly selective about 
their raw materials, importing obsidian from 
up to 90 kilometers away. Such far-flung con- 
nections are a “hallmark of human social 
organization, and an important buffer in for- 
ager societies,’ whose members may move to 
distant places in hard times, Brooks says. The 
tools also are smaller, more precise, and more 
uniform in shape than Acheulean tools. They 
represent a milestone in abstract thinking: A 
hand ax preserves the shape of the original 
lump of rock, but creating a blade from an 
already prepared core forces toolmakers to 
visualize its shape in advance, Brooks says. 

The team also found chunks of black rock 
and pieces of red ochre that had been punc- 
tured by sharp stone chisels. They propose 
both were used as pigments to create marks 
of individual or group identity, suggesting a 
high degree of social organization. 

Expanded social networks are “a key 
unique feature of modern humans,” says 
archaeologist Curtis Marean of Arizona 
State University in Tempe, although he sus- 
pects there will be some debate about the 
evidence for long-distance networks, noting 
that most of the stone came from just 25 to 
50 kilometers from the site. 

By combining artifacts with environmental 
data, the papers help explain what drove the 
MSA, says archaeologist Shannon McPherron 
of the Max Planck Institute for Evolutionary 
Anthropology in Leipzig, Germany. “They 
were able to use the long time-sequence at 
Olorgesailie to demonstrate how changes in 
the environment and the fauna correspond to 
the shift to the MSA.” 

But these MSA toolmakers hadn’t devel- 
oped the full package of sophisticated be- 
havior, Dunbar cautions. “It’s at the bottom 
of the scale of modern behavior,’ he says. 
“We're not talking about Salvador Dali.” & 
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INFECTIOUS DISEASE 


Nigeria hit by unprecedented 
Lassa fever outbreak 


As efforts to contain it mount, researchers are racing to find 
out what is driving this year’s surge in cases and deaths 


By Leslie Roberts 


y early January, it was clear some- 
thing “really, really extraordinary” 
was going on in Nigeria, says Lorenzo 
Pomarico of the Alliance for Interna- 
tional Medical Action (ALIMA). Cases 
of Lassa fever, a rare viral hemor- 
rhagic disease, were skyrocketing across the 
country—more were recorded in the first 
2 months of 2018 than in any previous year. 
Unprepared for a disease that has no vac- 
cines or drugs and kills 20% to 30% of those 
it sickens, eight health care workers were 
infected early on and three died. “Some- 


Ra od 


Already, Nigeria’s fragile health care sys- 
tem is overwhelmed. The one dedicated 
Lassa fever ward in the country at Irrua 
Specialist Teaching Hospital in Edo state 
has just 24 beds. Without access to proper 
training and personal protective equip- 
ment, health care workers continue to 
become infected—by now 16 cases have 
been reported, with one additional death. 

As the government and its international 
partners scramble to set up isolation wards 
and deliver protective gear to health work- 
ers, researchers on three continents are 
racing to figure out what is driving the un- 
precedented outbreak. Is it simply better 


This year, the rats that carry Lassa fever may be more numerous, or more likely to harbor the virus. 


thing was going very wrong with the out- 
break,” Pomarico says. 

Since then, the situation has only gotten 
worse. The rodent-borne disease is endemic 
in Nigeria and several other West African 
countries, fluctuating with the seasons and 
usually causing “a trickle” of cases a year, 
says Chikwe Ihekweazu, who heads the Nige- 
ria Centre for Disease Control in Abuja. But 
as of 11 March, 365 cases and 114 deaths had 
been confirmed across 19 states, with many 
more suspected. Ihekweazu says the record- 
setting figures are sure to be underestimates, 
because the disease is maddeningly hard to 
diagnose, and many cases go unreported. 


Published by AAAS 


disease surveillance in the wake of Ebola, 
the similar but more deadly disease that be- 
gan its rampage across West Africa in 2014? 
Has the virus changed in some way? Are 
there more of the rats that carry it, or are 
more of them infected? Or is another rodent 
capable of spreading the virus as well? 
“There are lots of possible explanations,” 
says Stephan Giinther, who heads the 
virology department at the Bernhard Nocht 
Institute of Tropical Medicine in Hamburg, 
Germany, and whose team has long collabo- 
rated with Nigerian researchers. Consider- 
ing how lethal Lassa fever is, shockingly 
little is known about it, he says. “We don’t 
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know why people die. We don’t know about 
the pathophysiology of the disease. We don’t 
know the point of no return.” 

That could be beginning to change. In 
2016, the World Health Organization added 
Lassa fever to its new list of priority patho- 
gens of epidemic potential, calling for more 
research. And last week, the recently cre- 
ated Coalition for Epidemic Preparedness 
Innovations, known as CEPI, awarded its 
first grant for development of a Lassa fever 
vaccine to Themis Bioscience in Vienna. 

Lassa fever was discovered in 1969, 
when two missionary nurses died of a 
mysterious disease in the remote town of 
Lassa in Borno state in northeastern Ni- 
geria. When a third nurse fell ill, she was 
evacuated to a hospital in New York City— 
along with a thermos full of blood and 
other samples from all three nurses, bound 
for Yale University’s then-new Arbovirus 
Research Unit. There, a team led by Jordi 
Casals-Ariet isolated a novel virus from the 
samples. (He, too, almost died in the pro- 
cess, saved only by an infusion of antibody- 
rich plasma from the third 
nurse, who recovered.) 

The cause is now known 


unrecognized, outside of West Africa, 
where it could be responsible for un- 
diagnosed fevers. 

Initial symptoms are easily mistaken for 
malaria or typhoid fever—body aches, sore 
throat, fever, nausea, diarrhea—before the 
disease progresses to organ failure, shock, 
and sometimes internal hemorrhaging. By 
the time doctors suspect Lassa fever, it’s of- 
ten too late to save the patient. There is no 
rapid test; accurately diagnosing the dis- 
ease requires a real-time polymerase chain 
reaction technique, but just three labs in 
Nigeria have that capability. 

For now, the only treatment is a non- 
specific antiviral drug, ribavirin. If it’s ad- 
ministered during the first 6 days of the 
illness, it seems to improve a patient’s prog- 
nosis, but “no one arrives before day 7,” 
Augier says. Nor is everyone convinced that 
ribavirin works in Lassa fever, as the only 
data come from the 1980s, Augier says. 

Several potential drugs are on the horizon, 
in addition to the vaccine. Christian Happi 
at Redeemer’s University in Ede, Nigeria, 
and the Irrua Specialist 
Teaching Hospital is de- 
veloping a rapid diagnos- 


es 
to be an arenavirus, one T used to scream tic test with colleagues at 
vathogens Tis naxural req. CUESCRCEM Cte scisiana; the 
ervoir is a multimammate Lassa is umpor tant, Broad Institute in Cam- 
mt socal bs 5 but no one listened. ise, Macrncts 
is ubiquitous across West ... Lassa fever isa Germantown, Maryland. 
Africa. Cases peak in the disease of the poor.” Happi’s group and_ its 
dry season, when farmers partners are also sequenc- 


burn the bushes in prepa- 
ration for spring plant- 
ing and rats scurry into 
houses in search of food. The rodents shed 
the virus in their urine and droppings, and 
people contract it by touching contaminated 
surfaces, inhaling viral particles, or eating 
contaminated food (including the rats). Like 
Ebola, the virus can also be spread through 
contact with bodily fluids from an infected 
person. Such human-to-human transmission 
is thought to be rare for Lassa, unlike Ebola, 
except in hospital settings without proper 
infection control. However, “The real rate of 
human-to-human transmission is unknown,” 
says Augustin Augier, secretary general of 
ALIMA in Paris, which has just launched 
a Lassa fever research program with the 
French medical institution INSERM. 

No one knows the true incidence of the 
disease. “Most cases we have found are in 
places where there are hospitals and labs,” 
Giinther says. “There is good reason to as- 
sume there are cases that are being over- 
looked.” And because the rat vector lives 
across a broad swath of the continent, 
the disease might also be endemic, but 
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Christian Happi, 
Redeemer’s University 


ing the virus “around the 
clock,” he says, and trying 
to figure out whether the 
genetic changes they have seen could have 
made it more transmissible or virulent. 

For Happi, who diagnosed Sierra Le- 
one’s first case of Ebola, the new attention 
to Lassa hasn’t come a moment too soon. 
“I used to scream and scream that Lassa 
is important, but no one listened,” he says. 
“I wrote so many grants” that were turned 
down. “Lassa fever is a disease of the poor 

. it is confined to a part of West Africa, 
and it is not viewed as a global threat.” 

As research scales up, the government 
and its partners are focusing on training 
health care workers and providing the ba- 
sics needed for infection control, as well 
as educating a frightened public about 
safety precautions. 

Pomarico, who is leading ALIMA’s emer- 
gency response to the outbreak in the two 
hardest hit states of Edo and Ondo, hopes 
cases will subside with the rains and cooler 
weather, as they usually do. “But this year 
is different. We are bracing for worst and 
preparing for the worst.” 


Published by AAAS 


QUANTUM PHYSICS 


Vibrations 
used to talk 
to quantum 
circuits 


Sound waves could supplant 
microwaves in controlling 
quantum computers 


By Adrian Cho, in Los Angeles, California 


or the moment, microwave photons are 
the keys to many quantum computers: 
Physicists use them to program, read 
out, and otherwise manipulate the ma- 
chines’ quantum bits. But microwave 
technology is bulky, and its quantum 
states don’t last very long. Now, several 
groups are exploring a new way to talk to 
a quantum computer: with tiny vibrations, 
normally carriers of pesky heat and noise. 

The budding discipline of quantum acous- 
tics could shake up embryonic quantum 
computers by miniaturizing technologies 
and producing longer-lasting quantum mem- 
ories. “We're right on the cusp” of controlling 
quantum vibrations, says Andrew Cleland, a 
physicist at the University of Chicago in Il- 
linois, whose group presented its latest work 
last week here at the annual March meeting 
of the American Physical Society. 

Whereas an ordinary computer flips bits 
that can be set to either zero or one, a quan- 
tum computer uses qubits that can be set 
to zero, one, or, bizarrely, zero and one at 
the same time—potentially enabling huge 
boosts in speed. Companies such as Google 
and IBM are racing to demonstrate the su- 
periority of quantum computers for certain 
tasks (Science, 2 December 2016, p. 1090), 
and many are betting on qubits made of 
superconducting metal circuits on chips. 

To control or read out a superconducting 
qubit, researchers make it interact with a 
microwave resonator—typically a strip of 
metal on the qubit chip or a finger-size cav- 
ity surrounding it—which rings with micro- 
wave photons the way an organ pipe rings 
with sound. By adjusting the energy of the 
qubit, researchers can shuttle its quantum 
states into the resonator, so that a zero-and- 
one state of the qubit can be stored as a state 
of the resonator in which a photon is both 
present and absent. But some physicists 
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see advantages to replacing the microwave 
resonator with a mechanical one that rings 
with quantized vibrations, or phonons. 

That effort may seem daft, as such vi- 
brations constitute heat, which obliterates 
delicate quantum states. But when work- 
ing at temperatures near absolute zero, 
a well-designed acoustic resonator could 
ring longer than a microwave one does, en- 
abling it to act as a sort of quantum mem- 
ory, says Robert Schoelkopf, a physicist at 
Yale University. The vibrations also have 
wavelengths less than a thousandth as 
long as microwaves of the same frequency, 
so the resonators can be far more compact, 
he says. 

First, physicists must learn to control 
quantum vibrations. They took a first step 
in 2010, when Cleland, then at the Univer- 
sity of California, Santa Barbara (UCSB), si- 
phoned every phonon out of an oscillating 
cantilever etched from aluminum nitride, 
leaving it in its least energetic quantum 
ground state. However, that simple quan- 
tum state persisted for just 5 nanoseconds, 
too little time to put the device into more 
complex quantum states of motion. 

To push further, several groups are ma- 
nipulating ripples called surface acoustic 
waves (SAWs), which travel along a mate- 
rial’s surface. On top of a microchip, the 
researchers etch two gratings of metal 
stripes just micrometers apart. In the gap 
between the gratings, the researchers trig- 
ger a wave by applying a voltage to a comb- 
shaped device called a transducer, which 
causes the material to contract. The grat- 
ings act as mirrors, reflecting SAWs of par- 
ticular wavelengths back and forth so that 
they resonate in the gap. And by connect- 
ing the transducer to a superconducting 
qubit, researchers link its quantum state 
to the SAWs. 
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Using that approach, Cleland and Kevin 
Satzinger, a UCSB graduate student, fash- 
ioned a resonator on a lithium niobate chip 
that rang for up to 150 nanoseconds. They 
showed they could create any desired com- 
bination of zero and one phonons in the 
resonator, Satzinger told the meeting. “We 
can watch the energy going back and forth” 
between qubit and cavity, he says. 

Researchers in Schoelkopf’s group are 
focusing not on waves trapped on a chip’s 
surface, but on vibrations traveling through 
the chip’s bulk material. They exploit vibra- 
tions that can bounce between the upper 
and lower surfaces of the half-millimeter- 
thick chip beneath their qubit. 

Using that geometry, the research- 
ers kept vibrations in their sapphire chip 
ringing for up to 60 microseconds, Yale’s 
Yiwen Chu told the meeting. Moreover, the 
researchers could feed up to seven quanta 
of vibration one by one into the resonator, 
she reported. Making more complex quan- 
tum states is “really the next step,’ Chu 
says. For example, she says, they might try 
to put the resonator into a Schrédinger cat 
state, in which it would contain a macro- 
scopic sound wave, comprising many vi- 
brational quanta, and at the same time be 
devoid of vibrations. 

Acoustic resonators could offer more flex- 
ibility in quantum circuit designs. In some 
circuits, multiple qubits are linked to the 
same microwave resonator, which acts as 
a conduit for the qubit interactions. But 
most microwave cavities can only host pho- 
tons of a single frequency. In that case, all 
the qubits must interact with one another 
in an interconnected tangle, says Konrad 
Lehnert, a physicist at JILA, an institute 
run jointly by the University of Colorado in 
Boulder and the National Institute of Stan- 
dards and Technology. 
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Micrometer-size acoustic 
resonators (orange) 

trap vibrations that can 
control quantum bits. 


In contrast, acoustic resonators can en- 
able qubits to interact with vibrations of 
a few different, closely spaced frequencies. 
That should make it possible to tailor the 
interactions among the qubits, so that, for 
example, only nearest neighbors interact— 
desirable for modeling certain abstract 
quantum systems, Lehnert says. He and his 
colleagues have taken a step toward such 
control, JILAs Bradley Moores told the 
meeting, by showing that they could simul- 
taneously couple a single qubit to SAWs of 
several frequencies. 

Quantum acoustics might also help solve 
a major problem for emerging quantum 
technologies. Microwave cables can shuttle 
information within a quantum computer. To 
move it to other experiments or distant loca- 
tions, however, those signals will likely need 
to be converted from microwaves to optical 
photons, which can travel great distances 
in optical fibers. Acoustic waves rippling at 
microwave frequencies have wavelengths 
similar to those of optical photons. So in prin- 
ciple, they could serve as a bridge to translate 
between the two, researchers say, although 
nobody yet knows exactly how to do it. 

Acoustic resonators might even test the 
bounds of the quantum realm. Quantum 
theory allows tiny things like atoms or pho- 
tons to be in two places at once, but nobody 
has ever seen such behavior in a macroscopic 
material object. Some theorists argue that a 
yet-unknown principle, perhaps involving 
gravity, would prevent it from happening 
for large objects. But Chu says it might be 
possible to make her group’s sapphire chip 
simultaneously vibrate in opposite direc- 
tions. That would put tens of micrograms 
of material in two slightly different places 
at the same time—and test whether quan- 
tum weirdness extends almost to the human 
scale. “You don’t know until you try.” 
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GERMANY 


Max Planck Society, at a 
crossroads, seeks new leaders 


A wave of retirements offers a chance to recruit female 
institute directors and open up new research avenues 


By Kai Kupferschmidt, in Berlin 


rin Schuman and her husband Gilles 

Laurent left their jobs at the Califor- 

nia Institute of Technology in Pasa- 

dena a few years ago and moved their 

family, cat, and dog halfway across the 

globe to Frankfurt, Germany. It took 
some courage. She’s American, he’s French, 
and two of their daughters were in school; 
resettling in Europe was “certainly chal- 
lenging,” Schuman says. 

But the move came with huge professional 
benefits. Schuman and her husband, both 
neuroscientists, moved to the Max Planck 
Institute for Brain Research as “directors,” 
prestigious positions as the heads of large 
research groups that come with guaranteed 
funding until retirement and full academic 
freedom. They also had the opportunity to 
design a brand new lab. (“We worked with 
the architects to maximize the interactions 
and happiness of our colleagues,” Schuman 
says.) The move turned out to be a “tremen- 
dously enriching experience.” 

Many scientists may soon have a similar 
chance. The Max Planck Society (MPG) re- 
cently took out ads in major scientific journals 
to recruit 20 new directors in fields rang- 
ing from astrophysics to terrestrial micro- 
biology, in one of its biggest talent searches 
ever. Between now and 2030, roughly 200 of 
the 300 director posts at the 84 institutes will 
become vacant, says an MPG spokesperson. 
Many directors are coming up for retirement 
at the 20 institutes opened in the eastern part 
of the country after the German reunification 
in 1990, and so are many of their counterparts 
at older institutes in former West Germany. 

For the society it marks a turning point: 
a chance to hire more women and more for- 
eign researchers, and an opportunity to open 
up entirely new fields of research, because 
incoming directors can set their own course. 
(There are also plans to launch completely 
new institutes on cybersecurity and the ori- 
gins of life.) But it’s also a huge challenge. 
Finding directors is a lengthy process, and 
competition from other institutions seeking 
top talent—especially women—is fierce. And 
some argue that the society should use the 
opportunity to rethink some of its traditions. 
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Founded in 1948, MPG has a €1.8 billion 
annual budget and is Germany’s leading re- 
search powerhouse. The Nature Publishing 
Index ranks it as the fourth largest contrib- 
utor to high-quality research in the world; 
its researchers have won 18 Nobel Prizes. 

Most of the institutes are led jointly by 
three to five directors who each also run 
their own research departments. As an 
MPG director, “You have an incredible lib- 
erty to research what you want, even chang- 
ing your field if you like” says Christiane 
Niisslein-Volhard, who won a Nobel Prize in 
Physiology or Medicine as a director at the 
Max Planck Institute for Developmental 
Biology in Tiibingen in 1995. There are pe- 
riodic evaluations, but a poor result means 
losing only a fraction of your funding, says 
Schuman, who previously held one of the 


“You have an incredible 
liberty to research what you 
want, even changing your 
field if you like.” 


Christiane Nisslein-Volhard, Max Planck 
Institute for Developmental Biology 


plum positions in U.S. science: as an inves- 
tigator funded by the Howard Hughes Medi- 
cal Institute on a 5-year contract. “I did not 
realize how the renewal clock of 5 years dis- 
suaded me from going for risky ideas until 
I became a [Max Planck] director,’ she says. 

When a director leaves or retires, MPG 
doesn’t just look to “fill a certain slot,’ says 
Jiirgen Renn, director of the Max Planck 
Institute for the History of Science here. 
Instead it seeks excellent scientists and re- 
shapes the institute’s mission around them, 
sometimes altering it completely. When 
the directors of the Max Planck Institute of 
Economics in Jena left within a few years of 
each other, for instance, prominent econo- 
mists proved hard to lure to that city. So 
MPG in 2014 refashioned it into the Max 
Planck Institute for the Science of Human 
History, which has since done pioneering 
work on the history of plague. 
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The hiring process is arduous. Tradition- 
ally, an institute’s current directors identify 
potential candidates, who are then reviewed 
by outside referees. (The recent ads were a 
new addition.) Settling on a new director can 
take years. The society currently fills about 
10 positions a year, so the rush of openings 
may be a challenge. And the “explosion of re- 
search at universities all around the globe” 
means that competition “is a lot more in- 
tense,” says U.S. economist David Audretsch, 
a former director at the Jena institute. 

Finding women is especially hard. The 
number of female directors has gone up 
from 4.5% in 2005 to about 15% today; MPG 
aims to reach 18% by 2020. (Among recent 
catches is gene-editing pioneer Emmanuelle 
Charpentier.) But a recent MPG report men- 
tioned an international “bidding war” for se- 
nior female talent. To expand the pool, female 
scientists should receive much more support 
early in their careers, Renn says. One effort 
is MPQ@’s Lise Meitner excellence program, 
launched late last year, which will establish 
up to 10 new research groups led by women 
every year. Recruiting directors at a younger 
age could also help, Schuman says: “I know 
this younger pool has lots of qualified and 
amazing women who can join our ranks.” 

Attracting international talent is another 
challenge. Under Germany’s federal system, 
MPG institutes are spread around the coun- 
try, from big cities like Berlin or Munich 
all the way to places such as Pl6n, a small 
lakeside resort in the north, that are less at- 
tractive to foreigners who want an interna- 
tional school or an airport close by. 

The biggest drawback of MPG institutes is 
their “outdated” isolation from universities, 
says Thomas Siidhof, a biochemist and Nobel 
laureate at Stanford University in Palo Alto, 
California, who was an MPG director in the 
1990s but left after falling out with Hubert 
Markl, then MPG@’s president. “Science is a 
team effort and it’s important who you meet 
in the hallway or in the cafeteria,’ Siidhof 
says. “The Max Plank Society would profit 
enormously if they integrated their institutes 
directly into universities.” Schuman says he 
has a point, but warns that integration might 
lead to a “class system” within universities 
that favors Max Planck appointees. MPG is 
trying another solution: bringing institutes in 
the same city together in campuslike clusters. 

As to filling the wave of new director 
posts, “I think we should dare to be riskier,” 
Schuman says. “We should not be afraid of 
failing, but rather expect that some failures 
will accompany our riskier choices, which 
could lead to game-changing discoveries.” In 
other words, MPG may just need a bit more 
of the courage it takes for an individual scien- 
tist to pull up stakes and move their lab and 
family to another continent. 
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INFECTIOUS DISEASE 


Concern as HIV prevention strategy languishes 


Few countries promote prophylactic drug regimen, and many people shun it 


By Jon Cohen, in Boston 


here’s no question that a simple regi- 

men of a single daily pill can slash HIV 

infections in people at risk. But al- 

though millions of people around the 

world could benefit, only 200,000 are 

prescribed what’s called pre-exposure 
prophylaxis, or PrEP, and 75% of them are 
in the United States. The disparity was the 
focus of anguished discussions last week at 
the largest annual U.S. HIV/AIDS conference. 
“Tm frustrated,’ says Linda-Gail Bekker, a re- 
searcher at the University of Cape Town in 
South Africa. “We have nearly 1700 infections 
every week in young women and girls, and 
this might be a way to throttle this.” 

PrEP, which uses antiretroviral (ARV) pills 
made for treatment as preventives, faces mul- 
tiple obstacles, as Bekker and other discussed 
at the Conference on Retroviruses and Op- 
portunistic Infections here. Many countries 
already struggle to provide ARVs to people 
who are infected and say they can’t afford 
PrEP for the large uninfected populations 
who need it most. People at substantial risk 
of becoming infected by HIV may not know 
about PrEP or may reject it, as several presen- 
tations highlighted. And PrEP raises touchy 
issues including teen sexuality and the ap- 
pearance of advocating risky behavior. 

Yet PrEP works. Its effectiveness in sev- 
eral clinical trials led the U.S. Food and Drug 
Administration to approve preventive use of 
ARVs in 2012. The World Health Organiza- 
tion in 2015 recommended PrEP for everyone 
at “substantial risk” of becoming infected—a 
total of tens of millions of people, to judge 
from the nearly 2 million new HIV infections 
in 2016. In some countries, including the 
United States, the targeted groups are “key 
populations” such as men who have sex with 
men (MSM) and sex workers. In sub-Saharan 
Africa, where most people become infected 
from heterosexual sex and many have no 
“high-risk” behavior, large swaths of the pop- 
ulation are candidates for PrEP. 

Two reports at the meeting underscored 
its power. In San Francisco, California, the 
number of people using PrEP more than tri- 
pled from 2013 to 2016, to 12,300. Research- 
ers found that the city’s new HIV diagnoses 
dropped by 43% during that period, although 
they note that it isn’t easy to disentangle the 
impact of PrEP from other factors. In Austra- 
lia, the state of New South Wales launched 
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a PrEP project that began enrolling MSM 
in 2016. As Andrew Grulich, an epidemio- 
logist at the University of New South Wales in 
Sydney, reported, more than 9000 MSM have 
now started PrEP. In the first half of 2017, 
Grulich says the team saw a “very substantial 
decline” in new infections—a drop of 35%. 
Only one hard-hit country in sub-Saharan 
Africa aggressively promotes PrEP—Kenya, 
where 13,000 people are now using it. South 
Africa has an HIV-infected population es- 
timated at 7.2 million, more than any other 
country, and yet only 5000 or so uninfected 
people are using PrEP. At a cost of a few hun- 
dred dollars a year per person for drugs and 
monitoring, PrEP would add to the huge eco- 
nomic burden that the country faces in trying 
to treat all infected people, Bekker acknowl- 


has one of the highest new infection rates 
in the country—in Washington, D.C., and 
Baltimore, Maryland, found that 86% knew 
about PrEP. But only 39% had tried it, in part 
because of unfounded concerns about inter- 
actions with hormones they were taking. 
Many researchers and affected communi- 
ties also worry that PrEP could backfire if it 
increases risk-taking behavior such as having 
more partners or not using condoms. But 
studies have not supported these concerns. 
Francois Venter of the University of the 
Witwatersrand in Johannesburg, South Af- 
rica, who has tested PrEP in sex workers, says 
his government is squeamish about offer- 
ing it to teens, who might benefit the most. 
Ideally, he says, PrEP would be promoted 
in school sex education classes. But he can’t 


Poorly PrEPared 


Only the United States, Australia, and Kenya aggressively promote pre-exposure prophylaxis (PrEP). 
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edges. But she says the government is also 
wed to a flawed analysis that suggested PrEP 
wouldn’t be cost effective outside the highest 
risk groups such as sex workers and MSM. 
Even the United States was slow to adopt 
PrEP, and several studies at the meeting ex- 
plored why. Many people do not recognize 
that they are at high risk of infection. Others 
worry about side effects—long-term use of the 
drugs can cause bone and kidney problems. 
Some simply prefer condoms or don’t want 
to take a daily pill. A study in four southern 
US. cities found that of 72 women who were 
deemed “PrEP eligible,’ only four had ever 
heard of it. Another study of 201 transgender 
black and Latina women—a population that 
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see that happening in South Africa’s “inher- 
ently conservative society” any time soon. 
“Everyone wrings their hands and whinges 
out about young girls, but we can’t get into 
schools,” Venter says. “It’s a political and pro- 
grammatic hot potato.” Ultimately, he adds, 
PrEP should be treated like contraception. 
“You should be able to walk in and buy it over 
the counter.” 

Nelly Mugo, an OB-GYN from the Ke- 
nya Medical Research Institute in Nairobi, 
stresses the need to listen to and address 
the concerns of PrEP skeptics. “We shouldn’t 
be discouraged by low initial uptake,’ Mugo 
says. “Maybe we need the slow period for 
people to be certain the products are good.” 


16 MARCH 2018 * VOL 359 ISSUE 6381 1205 


8L0Z ‘GL youey UO /Bio bewaousIos 90uUaINS//:dyy Wold papeojuUMOG 


NEWS 


MASTER 


PLANNER 


China’s revered rocket scientist, 
Qian Xuesen, set in motion a system for 
engineers to control Chinese society 


By Mara Hvistendahl, in Shanghai, China 


ts rare that a scientist becomes a 
folk hero. But in China, Qian Xuesen 
draws crowds almost a decade after 
his death. On a Saturday morning in 
a three-story museum here, tourists 
admire Qian’s faded green sofa set, the 
worn leather briefcase he carried for 
4 decades, and a picture of him shak- 
ing hands with opera star Luciano 
Pavarotti. They file past a relic from a turn- 
ing point in Qian’s life—and in China’s rise 
as a superpower: a framed ticket from his 
1955 voyage from San Francisco, California, 
to Hong Kong in China aboard the SS Presi- 
dent Cleveland. Once a professor at NASA’s 
Jet Propulsion Laboratory (JPL) in Pasa- 
dena, California, he had been accused of 
having communist sympathies in the heat 
of the Red Scare and placed under virtual 
house arrest. Upon his release, he and his 
family set sail for his motherland. 

After arriving in China, Qian went on 
to spearhead the rapid ascent of the coun- 
try’s nuclear weapons program, an achieve- 
ment that explains some of the adulation. 
But his legacy is still unfolding in a second 
area that could have great consequences for 
China—and for the world. Qian, who died in 
2009 at the age of 97, helped lay the ground- 
work for China’s modern surveillance state. 

Early in his career, he embraced sys- 
tems engineering—an  interdisciplinary 
field focused on understanding the general 
properties common to all physical and so- 
cietal systems, and using that knowledge 
to exert control. By mapping a system’s 
dynamics and constraints, including any 
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feedback loops, systems theorists learn 
how to intervene in it and shape outcomes. 
Since the field’s founding in the 1950s, 
systems approaches have been applied to 
areas as varied as biology and transporta- 
tion infrastructure. 

In the West, systems engineering’s hey- 
day has long passed. But in China, the dis- 
cipline is deeply integrated into national 
planning. The city of Wuhan is preparing to 
host in August the International Conference 
on Control Science and Systems Engineer- 
ing, which focuses on topics such as au- 
tonomous transportation and the “control 
analysis of social and human systems.” Sys- 
tems engineers have had a hand in projects 
as diverse as hydropower dam construction 
and China’s social credit system, a vast ef- 
fort aimed at using big data to track citi- 
zens’ behavior. Systems theory “doesn’t just 
solve natural sciences problems, social sci- 
ence problems, and engineering technology 
problems,” explains Xue Huifeng, director 
of the China Aerospace Laboratory of Social 
System Engineering (CALSSE) and president 
of the China Academy of Aerospace Systems 
Science and Engineering in Beijing. “It also 
solves governance problems.” 

The field has resonated with Chinese Presi- 
dent Xi Jinping, who in 2013 said that “com- 
prehensively deepening reform is a complex 
systems engineering problem.” So important 
is the discipline to the Chinese Communist 
Party that cadres in its Central Party School 
in Beijing are required to study it. By apply- 
ing systems engineering to challenges such as 
maintaining social stability, the Chinese gov- 
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ernment aims to “not just understand reality 
or predict reality, but to control reality,’ says 
Rogier Creemers, a scholar of Chinese law at 
the Leiden University Institute for Area Stud- 
ies in the Netherlands. 

With the discipline now touted at the 
highest levels of government, Qian has 
been deified, with biographies, television 
segments, and symposiums regularly de- 
voted to him. In the 1990s, the Chinese 
government even spearheaded a “learn 
from Qian Xuesen” movement. Popular 
discourse now acknowledges that modern 
China’s first leader, Mao Zedong, “was a 
human being,” says Zhichang Zhu, a sys- 
tems scientist at the Xiamen University 
Malaysia in Sepang. “But to a circle of sci- 
entists in China, Qian Xuesen is now, in 
their mind, the new god.” 

The traditional, or “hard,” brand of sys- 
tems engineering that Qian pioneered has 
lately come under attack from Zhu and 
other scholars, both inside and outside 
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China. They contend that it discounts the 
experiences of everyday people affected 
by systems models and values state power 
above all else. They are trying to carve out 
an alternative vision for systems science, 
one less reliant on mathematical formulas 
and more attuned to civic participation. 
But that could prove an uphill battle in a 
country where maintaining stability trumps 
scholarly debate. 


IN A BUILDING flanked by military guards, 
systems scientists from CALSSE sit around a 
large conference table, explaining to Science 
the complex diagrams behind their studies 
on controlling systems. The researchers 
have helped model resource management 
and other processes in smart cities powered 
by artificial intelligence. Xue, who oversees 
a project named for Qian at CALSSE, traces 
his work back to the U.S.-educated scientist. 
“You should not forget your original start- 
ing point,” he says. 
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Qian was born in 1911 in Hangzhou, 
in eastern China. In 1935, a scholarship 
brought him to the Massachusetts Insti- 
tute of Technology in Cambridge. He then 
went on to the California Institute of Tech- 
nology in Pasadena, where he worked with 
the Hungarian mathematician Theodore 
von Karman. When Von Karman and oth- 
ers founded JPL in 1944 to develop rocket 
technology, Qian was given a security clear- 
ance and brought on to work on classified 
weapons research. 

As the Red Scare took hold in the 1950s, 
the scientist came into the Federal Bureau 
of Investigation’s crosshairs. His security 
clearance was revoked, and after years of bi- 
lateral negotiations, Qian was allowed to re- 
turn to China. Back in Beijing, his previous 
experience designing complicated weapons 
systems and rockets for the United States 
became integral to China’s budding efforts. 
On 16 October 1964, at 3 p.m. local time, 
China detonated its first atomic bomb. Xue 
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A bust of Qian Xuesen at a museum in Shanghai, 
China, devoted to the revered scientist. In addition to 
designing sophisticated weapons systems, Qian laid 
the groundwork for China’s surveillance system. 


says the program succeeded in part because 
Qian modeled a complex weapons system 
down to its most unpredictable parts. He 
automated China’s weapons command and 
control system, enabling planners to direct 
the activities of thousands of people at once. 

As Qian honed China’s weapons systems, 
scientists in North America and Europe be- 
gan applying systems approaches to intrac- 
table policy problems, modeling them as a 
collection of inputs and variables linked by 
direct or inverse relationships and feedback 
loops. In the 1960s, for example, school 
districts across California tried systems ap- 
proaches. To help educators set budget pri- 
orities, multimillion-dollar data processing 
programs designed by JPL and the rocket 
manufacturer Aerojet General Corporation 
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collated children’s academic records, IQ 
scores, and attendance. 

According to critics, such efforts wasted 
money that could have gone toward hiring 
teachers and reduced to rational analysis 
what should have been a complex political 
process. “When the policymakers came in 
and started asking questions, [they] were 
talking about variables that weren’t in the 
models,” says Gerald Midgley, a systems sci- 
entist at the University of Hull in the United 
Kingdom. California’s data-driven approach 
to education was eventually scrapped. 
Those and other embarrassments brought 
the field into disrepute in the West. 


BACK IN CHINA, though, the notion that sci- 
entists could neatly model societal endeav- 
ors resonated with leaders reared on central 
planning. An early major contribution of so- 
cial systems scientists occurred in the late 
1970s, when Qian’s protégé, missile scien- 
tist Song Jian, led a team whose computer- 
generated projections showed China’s pop- 
ulation rising to 4 billion by 2080 (Science, 
17 September 2010, p. 1460). That work 
helped justify extreme restrictions on births 
after the government implemented the one- 
child policy in 1980. 

Soon after, systems scientists began as- 
sessing the feasibility of building the titanic 
Three Gorges Dam on the Yangtze River. 
The goal was to determine the optimal dam 
height and water level in the reservoir, bal- 
ancing the demands of power generation 
with other factors, including the massive 
project’s negative impacts. One group work- 
ing on the project took stock of 14 “sub- 
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A traffic officer walks past a facial recognition screen at an intersection in Shanghai, China. The Chinese government uses systems engineering to monitor crowd behavior. 


systems,” including geology, ecology, and 
human migration. The researchers then 
analyzed how various water levels would af- 
fect outputs such as seismic activity or the 
number of people forced to relocate. Ulti- 
mately, the group arrived at an ideal water 
impoundment level, 175 meters. The dam’s 
operator hewed to that advice, raising water 
levels to 175 meters by 2012. 

As with the one-child policy, though, the 
systems scientists entrusted with studying 
the Three Gorges Dam devoted little time to 
consulting people whom the project would 
affect most. (Dam building and the reser- 
voir that formed behind the structure dis- 
placed 1.3 million people in southwestern 
China.) Because the dam’s construction was 
a foregone conclusion, the feasibility study 
was limited to outcomes that reinforced 
government plans. Researchers in China 
often approach megaprojects like Three 
Gorges “from the perspective of how to 
successfully implement the project whose 
execution has already been decided politi- 
cally,’ says Yoshiteru Nakamori, a systems 
scientist and former dean of the School of 
Knowledge Science at Japan Advanced In- 
stitute of Science and Technology in Nomi. 

More recently, the involvement of China’s 
systems scientists in designing the coun- 
try’s digital infrastructure has raised simi- 
lar questions about whether the scientists 
are aiding the state at the expense of the 
public. Take China’s smart cities initiative. 
The Chinese government claims to have 
wired hundreds of cities with sensors that 
collect data on topics including city service 
usage and crime. At the opening ceremony 
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of China’s 19th Party Congress last fall, Xi 
said smart cities were part of a “deep inte- 
gration of the internet, big data, and artifi- 
cial intelligence with the real economy.” 

The initiative, which has received fund- 
ing from the United Nations Development 
Programme, has benign components. Xue 
and colleagues, for example, are working 
on how smart cities can manage water re- 
sources. In Guangdong province, the re- 
searchers are evaluating how to develop 
a standardized approach for monitoring 
water use that might be extended to other 
smart cities. 

But Xue says that smart cities are as 
much about preserving societal stability as 
streamlining transportation flows and miti- 
gating air pollution. Samantha Hoffman, a 
consultant with the International Institute 
for Strategic Studies in London, says the 
program is tied to long-standing efforts to 
build a digital surveillance infrastructure 
and is “specifically there for social control 
reasons” (Science, 9 February, p. 628). The 
smart cities initiative builds on 1990s sys- 
tems engineering projects—the “golden” 
projects—aimed at dividing cities into geo- 
graphic grids for monitoring, she adds. 

Layered onto the smart cities project is 
another systems engineering effort: China’s 
social credit system. In 2014, the country’s 
State Council outlined a plan to compile 
data on individuals, government officials, 
and companies into a nationwide track- 
ing system by 2020. The goal is to shape 
behavior by using a mixture of carrots and 
sticks. In some citywide and commercial 
pilot projects already underway, individu- 
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als can be dinged for transgressions such 
as spreading rumors online. People who 
receive poor marks in the national system 
may eventually be barred from travel and 
denied access to social services, according 
to government documents. 

Civil liberties groups charge that the 
system will deepen monitoring of the citi- 
zenry, especially if combined with the Chi- 
nese state’s growing biometrics capabilities 
(Science, 21 September 2012, p. 1448). Social 
credit is aimed at “further tightening the 
web of social control,” says Maya Wang, a re- 
searcher with Human Rights Watch in Hong 
Kong. (The Chinese government maintains 
that the system is about building trust and 
accountability as well as helping law 
enforcement identify criminals.) 

Government documents refer to 
the social credit system as a “social 
systems engineering project.’ Details 
about which systems engineers con- 
sulted on the project are scant. But 
one theory that may have proved 
useful is Qian’s “open complex giant 
system,” Zhu says. A quarter-century 
ago, Qian proposed that society is a 
system comprising millions of sub- 
systems: individual persons, in hu- 
man parlance. Maintaining control in 
such a system is challenging because 
people have diverse backgrounds, 
hold a broad spectrum of opinions, 
and communicate using a variety of 
media, he wrote in 1993 in the Jowr- 
nal of Systems Engineering and Elec- 
tronics. His answer sounds like an 
early road map for the social credit 
system: to use then-embryonic tools 
such as artificial intelligence to col- 
lect and synthesize reams of data. 

According to published papers, 
China’s hard systems scientists also use 
approaches derived from Qian’s work to 
monitor public opinion and gauge crowd 
behavior. And systems science approaches 
are on display in Xinjiang, a region in 
northwest China with a high percentage 
of Muslims. According to Human Rights 
Watch, Xinjiang’s public security bureau 
is aggregating data from sources such as 
closed-circuit TV cameras, security check- 
points, and residents’ networked devices. 
Authorities then use a form of systems 
analysis adapted from People’s Liberation 
Army doctrine to flag people seen as poten- 
tially disruptive. 


AFTER SYSTEMS ENGINEERING fell from 
grace in the West, researchers spearheaded 
a fundamentally new approach. In 1981, 
Peter Checkland of Lancaster University in 
the United Kingdom called for a “soft” sys- 
tems science that valued input from stake- 
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holders over mathematical modeling. In 
Checkland’s vision, experts exist not to im- 
pose their values, but instead to learn from 
people involved in the problem at hand. 
When Zhu left China in 1988 to earn a mas- 
ter’s degree at the University of Hull, that 
was the approach he encountered. Zhu 
studied there under Midgley, just as systems 
engineering was transforming in the West. 
A few years later, Zhu began collabo- 
rating with Gu Jifa in Beijing, who had 
worked on the Three Gorges Dam assess- 
ment and become head of the Systems En- 
gineering Society of China. Zhu had come 
to see Qian’s brand of systems science as 
“brain without soul.” Gu, meanwhile, had 


fe asi 


learned firsthand the importance of what 
he called renli, or human relations, in 
shaping project outcomes. For example, 
his recommendations for an urban devel- 
opment plan in Beijing were not adopted 
because his team neglected to engage key 
stakeholders. Hard systems engineering 
worked well for rocket science, but not for 
more complex social problems, Gu says: 
“We realized we needed to change our ap- 
proach.” He felt strongly that any methods 
used in China had to be grounded in Chi- 
nese culture. 

The duo came up with what it called the 
WSR approach: It integrated wuli, an in- 
vestigation of facts and future scenarios; 
shili, the mathematical and conceptual 
models used to organize systems; and 
renli. Though influenced by U.K. systems 
thinking, the approach was decidedly 
eastern, its precepts inspired by the em- 
phasis on social relationships in Chinese 
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culture. Instead of shunning mathematical 
approaches, WSR tried to integrate them 
with softer inquiries, such as taking stock 
of what groups a project would benefit or 
harm. WSR has since been used to calcu- 
late wait times for large events in China 
and to determine how China’s universities 
perform, among other projects. 

Despite the efforts of Gu and others, sys- 
tems science in China today remains rooted 
in the hard systems engineering approaches 
that rocket scientists pioneered decades 
ago. That emphasis is apparent at the cin- 
der block apartment in Beijing where Qian 
lived for nearly a half-century, which is now 
an unofficial museum for state visitors. 


China’s systems engineers helped determine the optimal water level for the controversial Three Gorges Dam. 


Giving a tour of the apartment, Qian 
Yonggang, Xuesen’s 69-year-old son, ges- 
tures to a living room decorated with blond 
wood accents. “Many Chinese leaders have 
sat here,” he says. He moves on to the bed- 
room, indicating the twin bed in which his 
father spent his final days. Hanging above 
it is a framed photo of a somber-looking 
Qian Xuesen. 

Zhu contends that the time has come to 
bring the god down to Earth. He recently 
wrote that systems science in China is “un- 
der a rationalistic grip, with the ‘scientific’ 
leg long and the democratic leg short.” Zhu 
says he has no doubt that systems scientists 
can make projects such as the social credit 
system more effective. However, he cau- 
tions, “Systems approaches should not be 
just a convenient tool in the expert’s hands 
for realizing the party’s wills. They should 
be a powerful weapon in people’s hands for 


29 


building a fair, just, prosperous society.” & 
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PHYSIOLOGY 


Rhythms: The dark side meets the light 


Researchers examine the 24-hour biological clock in daytime-active baboons 


By Arthur Millius' and Hiroki Ueda’ 


n internal biological rhythm, the 
circadian clock—which can be mea- 
sured by changes in rhythmic gene 
expression, cellular activity, or 
physiological behavior—enables an 
organism to anticipate daily cyclic 
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changes in the environment. For years, 
researchers have examined mammalian 
behavioral and physiological rhythms pri- 
marily in nocturnal species such as mice 
and rats; but in what ways are the genes, 
neural circuits, and rhythms of nocturnal 
animals related to those of humans, who 
(mostly) operate during the day? On page 
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1232 of this issue, Mure et al. (1) examine 
our close, evolutionarily related cousin— 
the wild olive baboon (Papio anubis) (see 
the photo)—to understand the internal 
clock of a diurnal primate, which has impli- 
cations for understanding human biologi- 
cal rhythms and chronotherapy (treatment 
with drugs based on the time of day). 
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The internal oscillations of the wild olive baboon may 
provide insights into human biological rhythms. 


A vast empire of knowledge on mamma- 
lian biological rhythms is built primarily on 
research on nocturnal rodents, but how does 
this information translate to animals that 
are active during the day? Nocturnal and 
diurnal rodents both have similar brain tis- 
sues, neurotransmitters, and neuronal pace- 
maker cells (2-4), and diurnal rodents also 
have rhythms in a few core clock genes and 
proteins (5, 6). Researchers have sampled 
various tissues throughout the day in a noc- 
turnal mouse and have reported that about 
half of all genes have oscillatory expression 
(7). This suggests that widespread rhythmic 
changes in gene expression accompany daily 
changes in activity, from getting sleepy to 
feeling hungry. However, there is no 
corresponding timetable of gene ex- 
pression in neuronal and peripheral 
tissues of a diurnal organism, or one 
closely related to humans, so that we 
can begin to draw connections be- 
tween the timing of gene expression, 
cellular function, and behavior. 

Mure e¢ al. addressed this gap by ex- 
amining physiology and gene expres- 
sion every 2 hours over a 24-hour period in 
the daytime-active wild olive baboon and 
found that nearly 82% of transcripts had 
rhythmic oscillation in at least one tissue. 
This means that time-dependent gene ex- 
pression is not a paradigm limited to a few 
core genes that regulate the circadian clock 
but rather is a pervasive element of primate 
physiology. In fact, about half of the most 
prevalent drugs in the United States target 
the products of rhythmic genes (7), which 
provides an opportunity to increase drug ef- 
ficacy by timing delivery with their expres- 
sion (chronotherapy). 

Approximately 11,000 transcripts were ex- 
pressed in all 64 sampled tissues, which the 
researchers called ubiquitously expressed 
genes, including many involved in basic 
cellular functions such as DNA repair, tran- 
scription, and protein homeostasis. Most 
of these ubiquitously expressed genes were 
rhythmic in at least one tissue, but there 
was little overlap in rhythmic genes between 
tissues, which suggests that tissue-specific 
mechanisms control oscillatory expression. 
For example, a gene that had rhythmic ex- 
pression in the liver was constitutively ex- 
pressed in the heart. Because ubiquitously 
expressed genes control fundamental bio- 
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logical processes, timing their expression 
can affect the overall function of a tissue. For 
example, diurnal regulation of exocytosis in 
the thyroid or adrenal glands may enable 
rhythmic release of endocrine factors, com- 
pared with other organs in which the timing 
of exocytosis is less important for function. 
Only a few genes had rhythmic expression 
in more than one tissue—including many in- 
volved in DNA processing, metabolism, and 
circadian rhythms—but there were a few 
surprises. For example, the core clock tran- 
scriptional repressors cryptochrome 1 and 
cryptochrome 2 were expressed at unequal 
levels in different tissues. This suggests that 
each tissue has a distinct composition of cir- 
cadian clock components, like an hourglass 
counting time in sand or salt. Nowhere was 
the disparity in tissue-specific expression of 


“the internal clock of a diurnal 
primate...has implications for 
understanding human biological 
rhythms and chronotherapy...” 


core circadian clock genes more apparent 
than between the expression of two tran- 
scriptional activators that form a heterodi- 
mer: BMALI (brain and muscle ARNT-like 
1) and CLOCK (circadian locomotor output 
cycles protein kaput). The BMALI gene was 
rhythmically expressed in more than 40 sam- 
pled tissues, whereas CLOCK was rhythmic in 
less than 10 tissues (mostly overlapping with 
BMALI). In addition, the phase of CLOCK 
expression varied wildly over 12 hours, but 
the phase of BMALI expression was tightly 
limited to a 4-hour period. If BMALI1 and 
CLOCK are like copilots driving circadian 
gene expression, then it seems BMALI is 
mostly at the wheel. The gene that was rhyth- 
mically expressed in the greatest number of 
tissues encodes a relative newcomer to the 
circadian field: CHRONO (ChIP-derived re- 
pressor of network oscillator), which coun- 
teracts BMALI1 and CLOCK transcriptional 
activation in a manner similar to but inde- 
pendent of the cryptochrome paralogs and 
down-regulates anti-inflammatory responses 
through glucocorticoids (8, 9). The wide- 
spread rhythmicity of CHRONO expression 
suggests a more prominent role for circadian 
regulation of glucocorticoid signaling in diur- 
nal organisms. 

Is a diurnal animal simply a nocturnal one, 
with the gene expression phases flipped by 12 
hours? When Mure et al. compared wild olive 
baboon transcriptional oscillations of a few 
core circadian clock components with those 
of nocturnal mice, the phases were indeed 
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shifted by 12 hours in most tissues, with one 
notable exception. Phases in gene expression 
between mice and wild olive baboons were 
identical in a specialized brain structure 
called the suprachiasmatic nucleus (SCN). 
The SCN is called the central pacemaker be- 
cause removal of the SCN abolishes an organ- 
ism’s circadian rhythms (10, 11). Importantly, 
the SCN responds to light via light-sensitive 
cells in the retina, so it makes sense to as- 
sume that light synchronizes transcriptional 
oscillations in both nocturnal and diurnal 
animals by means of a similar mechanism. 
However, this finding means that nocturnal 
and diurnal behavior is regulated down- 
stream of the SCN and is perhaps coordinated 
by individual organs and tissues according to 
their particular metabolic requirements. For 
example, the phases of rhythmic transcripts 
between mouse and baboon in the 
heart, which presumably has similar 
metabolic requirements during night 
and day, were more closely aligned 
than those in the cerebellum, where 
sleep and wake may have a stronger 
impact on metabolic load. 

Mure et al. found that rhythmic 
gene expression was grouped into 
two phases—early afternoon and just 
before dawn—with a marked absence of 
expression around midnight. This biphasic 
circadian rhythm is reminiscent of fruit- 
fly behavior but differs from the nocturnal 
mouse, in which phases of gene expression 
are spread evenly throughout the day (7, 12). 
Unfortunately, Mure et al. did not determine 
how rhythmic gene expression was influ- 
enced by light, food, or temperature, which 
may be the subject of future work. For exam- 
ple, nocturnal rodents can be coerced into 
diurnal activity by restricting food access 
(13) or by increasing nighttime workload for 
food (14). Nevertheless, the data add an im- 
portant reference atlas for a daytime-active 
organism—the closest to humans and most 
complete yet—and reiterate that biological 
rhythms are not soloed by an SCN overlord 
but rather arise from a plucky band of tis- 
sue-specific mechanisms. 
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Circadian organization of the genome 


The clock protein Rev-erba regulates genome folding to establish circadian gene repression 


By Carolina Diettrich Mallet de Lima 
and Anita G6nd6r 


henotypic plasticity, the potential for 
phenotypic change in response to ex- 
ternal signals, drives adaptation to en- 
vironmental fluctuations and requires 
flexible gene regulation (7). A seminal 
example of adaptive plasticity is repre- 
sented by the circadian clock (2), which es- 
tablishes 24-hour rhythmicity in physiology, 
metabolic activities, and behavior. As external 
time cues, such as light and food intake, can 
reset the phase of oscillations, circadian ho- 
meostasis enables light-sensitive organisms 
to both anticipate and adapt to daily environ- 
mental cycles (2). On page 1274 of this issue, 
Kim et al. (3) provide a glimpse into 
the genome-wide complexity of tran- 


tacts, the mechanisms and dynamics of TAD 
formation are intensely investigated (9). 

Kim e¢ al. have revealed a surprising plas- 
ticity in genome-wide chromatin folding dur- 
ing the circadian cycle and documented its 
role in oscillating transcription. They gen- 
erated three-dimensional contact maps be- 
tween distant regions in living cells at 5 a.m. 
and 5 p.m. to determine changes in genome 
organization. Consistent with the previously 
reported cell type-independence of most TAD 
boundaries, at the two time points examined, 
Kim et al. found stable TAD-boundary loca- 
tions which correlated with the binding of 
the chromatin architectural proteins CCCTC- 
binding factor (CTCF) and cohesin, factors 
known to regulate TAD-boundary forma- 


clock function and is fine-tuned by the initial 
combinatorial binding of transcriptional acti- 
vators and repressors to CryJ enhancers, fol- 
lowed by its derepression upon the removal 
of its major transcriptional repressor, Rev- 
erba (0). Analysis of chromatin interactions 
complemented with mapping of Rev-erba 
binding demonstrated that CryI E-P contacts 
were circadian, in phase with Cry expres- 
sion, and antiphase to Rev-erba binding. 
Accordingly, genetic depletion of Rev-erba 
attenuated the amplitude of the frequency of 
E-P contacts, whereas in vivo overexpression 
of Rev-erba reduced the frequency of E-P 
loops and CryI expression. The mechanism 
of Rev-erba action involved the recruitment 
of nuclear receptor corepressor (NCoR) com- 
plex, which has histone deacetylase 
(HDAC) activity, followed by the 


scriptional plasticity during the phys- 
iological circadian cycle in mice, with 
implications for our understanding 
of diseases linked with deregulation 
of the circadian clock (2). 

Chromatin provides a platform for 
the integration and propagation of 
signaling and metabolic cues to influ- 
ence gene expression (4-6). How com- 
binations of chromatin modifications 
and transcription factors fine-tune 
the degree of transcriptional plas- 
ticity is central to our understand- 
ing of differentiation and diseases 
with epigenomic deregulation, such 
as cancer (7). Circadian clock-regu- 
lated plasticity in chromatin states 
at loci that are transcribed in a cir- 
cadian manner is well-documented 
and correlates with gene expression 
(5, 8). Chromatin, however, possesses 
regulatory information beyond his- 
tone and DNA modifications by in- 
fluencing how regulatory elements, 
such as enhancers and promoters, 
communicate with each other over 


Circadian plasticity of chromatin folding 
Enhancer-promoter (E-P) interactions in circadian sub-TADs are under 
circadian control and antagonized by Rev-erba. Factors regulating 
E-P loops that act as metabolic sensors might, in general, mediate cross- 
talk between metabolism and genome folding. 


t— Sub-TAD 
Enhancer —>) <—— Promoter 


Circadian gene 


5 a.m. 


NCoR 
HDAC3 
Metabolism : 
BRDA Substrate 
MEDI | + Cofactor 
? a: |. «Ligand 
Sp.m Cae ree regulating 
E-P loops 
NCoR 
HDAC3 


CTCF-cohesin-————€ 
BRD4 
MED1 


eviction of bromodomain-containing 
protein 4 (BRD4) and its partner me- 
diator complex subunit 1 (MED1)— 
factors implicated in loop formation 
(9) (see the figure). 

Interestingly, E-P loops that are 
present at 5 a.m. and involve sites 
that bind Rev-erba at 5 p.m. could be 
classified either as “engaged” (gene 
transcription is repressed at 5 p.m. 
upon Rev-erba binding) or “passive” 
(not repressed at 5 p.m. despite Rev- 
erba binding). The observation that 
Rev-erba binding was _ associated 
with rhythmic change in enhancer 
and gene activity only at engaged 
sites raises the questions of what 
functions the passive sites might 
have and how the effects of Rev-erba 
are buffered against at these sites, at 
the time points examined. Interest- 
ingly, Rev-erba can repress its tar- 
get genes through either direct DNA 
binding or indirect binding, tethered 
by cell type-specific transcription fac- 
tors to orchestrate cell type-specific 


large genomic distances (9). Maps of physical 
contact probabilities between distant regions 
have earlier revealed that the genome is orga- 
nized into topologically associating domains 
(TADs) displaying high local, intradomain 
chromatin-fiber contact frequencies. Given 
that TADs constrain and thereby increase the 
specificity of enhancer-promoter (E-P) con- 
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tion (9). In addition to stable TADs, the au- 
thors identified extensive remodeling of E-P 
interactions within hundreds of sub-TADs 
(smaller units within TADs with high interac- 
tion frequencies) containing circadian genes 
with expression peaks at 5 a.m. or 5 p.m. 

To identify the mechanism and relevance 
of circadian fluctuations in E-P contacts, they 
focused on cryptochrome 1 (CRY1), a major 
repressor in the core clockwork (J0). The 
phase of CryI gene expression is critical for 
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transcriptional rhythms (17). Unlike engaged 
sites, passive sites were not enriched in direct 
Rev-erba binding motifs. It might therefore 
be interesting to explore how local regulatory 
elements and the various mediators of indi- 
rect Rev-erba binding at passive sites might 
influence Rev-erba function(s). They might 
thus affect the recruitment of corepressors 
or alter the localization of target regions in 
the compartmentalized nucleus, where active 
and repressive nuclear environments tend to 
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be spatially separated, to potentially reduce 
transcriptional noise (1). 

Kim e¢ al. have uncovered a new principle 
of circadian gene repression by Rev-erba that 
involves the dynamic reorganization of ge- 
nome folding at circadian loci. Previously, it 
was difficult to predict to what extent genome 
organization undergoes circadian oscillations. 
Although these experiments suggest that 
circadian chromatin plasticity is restricted 
to sub-TADs enriched in circadian genes, it 
cannot be ruled out that rhythmic changes in 
chromatin folding do not leave nearby non- 
circadian genes unaffected. Given the role of 
transcriptional noise in cell-state transitions 
and phenotypic heterogeneity driving tumor 
evolution (7), it will be important to decipher 
any potential cross-talk between the clock- 
work and the machinery that regulates sto- 
chastic transcriptional fluctuations (7). 

The findings of Kim et al. also support the 
emerging theme that metabolic states might 
directly affect not only chromatin marks 
(4, 12) but also genome folding via “meta- 
bolic sensors” with functions in genome 
organization. Rev-erba binds to heme (13), 
which is synthesized in a circadian- and 
Rev-erba-dependent manner (/4). Interest- 
ingly, heme facilitates Rev-erba binding to 
the NCoR complex and its repressor function 
at metabolic genes (13), suggesting potential 
cross-talk between circadian genome folding 
and energy homeostasis. Another example 
is poly(ADP-ribose) polymerase 1 (PARP), 
which senses the amount of oxidized nicotin- 
amide adenine dinucleotide (NAD*) (4) and 
regulates, with CTCF, the rhythmic mobility 
of circadian genes between transcriptionally 
permissive and repressive nuclear compart- 
ments (15). As E-P contacts and their sub- 
nuclear localization are dynamic, E-P loop 
formation and chromatin mobility might be 
among the first responders to metabolic cues, 
for example, in response to diet. Further de- 
ciphering the cross-talk between metabolic 
sensors and genome organization will likely 
shed new light on diseases with deregulated 
phenotypic plasticity, such as cancer (7). & 
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A landscape of disgust 


Parasite avoidance behavior affects ecology and evolution in 
ways similar to predator avoidance 


By Sara B. Weinstein,' Julia C. Buck,” 
Hillary S. Young” 


rancid meal, a moist handshake, a 
pile of feces: These phenomena elicit 
disgust and avoidance that protect 
humans from our most pervasive con- 
sumer—infectious agents. This avoid- 
ance is not specific to humans. Various 
animals alter their behavior to avoid infec- 
tion (J). For instance, Poirotte et al. recently 
showed that mandrills avoid parasite-con- 
taminated feces and refrain from grooming 
infected individuals (2). These primates’ nu- 
anced ability to detect and alter their behav- 
ior in response to differential exposure risk 


suggests close parallels to the “landscape of 
fear” elicited by predators (see the figure), 
with perceived peaks and valleys driven by 
parasite abundance and exposure risk. 
Owing to the high cumulative fitness costs 
of parasitism over an individual’s lifetime, 
there will be strong selection for parasite 
avoidance. Yet, because most parasites are 
difficult to detect, avoidance often relies on 
indirect cues that are driven by long-term 
associations. Many species avoid feces and 
carcasses, regardless of infection status (J). 
Infection can also alter the chemical compo- 
sition of sweat, breath, and feces, allowing 
more targeted avoidance of infected indi- 
viduals (3). For instance, mandrills identify 


Overlapping landscapes of fear and disgust 

Animals detect predation threats, creating a three-dimensional fear landscape with mountains of risk and 
valleys of safety. Animals can also detect infection threats, which form a landscape of disgust. The resulting 
overlapping landscapes have far-reaching ecological and evolutionary consequences. 


Animals avoid 
contaminated 
food and water. 


Animals flee 
from parasitic 
and biting flies. 


Grouping protects 
against predators, 
but increases 
parasitism. 
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Avoiding predator — Grazers avoid Animals avoid 
feces protects forgaing near foraging where 
against predators carcasses. ticks are 

and parasites. questing. 


Perceived predation risk 
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and avoid infected conspecifics on the basis 
of olfactory cues associated with the altered 
chemical composition of infected feces (2). 
Thus, whereas prey “fear” predators, parasite 
avoidance is more akin to disgust—an innate 
aversion to cues associated with parasites (1). 
As with predator avoidance, parasite avoid- 
ance incurs costs such as altered social inter- 
actions, reduced reproductive opportunities, 
diminished foraging, and increased energy 
expenditure (4). To predict when parasite 
avoidance is most likely to occur and under- 
stand its consequences, it must be integrated 
into the same trade-off framework that is 
widely applied to predator avoidance. 
Beyond its direct effects on hosts, parasite 
avoidance has cascading effects at a range of 
scales and in various systems (5). Ants and 
their parasitoids provide a useful model 
system for studying the cascading effects of 
parasite avoidance on herbivory. Azteca ants 
experimentally exposed to parasitoid flies 
dramatically reduce their activity levels to 
avoid infection. This reduces the ants’ ability 
to attack their herbivorous prey, the coffee 
berry borer, and allows the latter to consume 
more coffee berries (6). Such ecological im- 
pacts of parasite avoidance are not limited 
to top-down effects on vegetation; they also 
alter mutualism, competition, and predation. 
Like the predator-induced landscape of 
fear, a parasite-induced landscape of dis- 
gust (see the figure) has far-reaching con- 
sequences for population dynamics, species 
interactions, and ecosystem structure. How- 
ever, whereas direct effects of predators are 
naturally restricted to prey species (7), a par- 
asite-induced landscape of disgust extends to 
all animals, including very large species and 
top predators that are typically immune to 
predation (8). For example, a recent synthesis 
by Mole6n et al. showed that carnivores avoid 
foraging on other carnivore carcasses and 
rarely engage in cannibalism (9). Feeding on 
conspecifics increases disease transmission, 
and forgoing this food resource reduces expo- 
sure to shared parasites (10). At a landscape 
level, this avoidance behavior dramatically 
alters the amount of carcass material that 
is available in an ecosystem, with important 
downstream effects. For instance, carcasses 
not consumed by mammalian scavengers 
may support more diverse and abundant 
arthropod communities (9). Furthermore, 
when carcasses decompose, their nutrient 
inputs increase plant growth (12). Like the 
ecosystem-level impacts of predator avoid- 
ance, the ecosystem-level impacts of parasite 
avoidance are likely to be substantial. 
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Parasite and predator avoidance can exert 
opposing pressures on host and prey species. 
For example, tadpoles increase their activity 
levels to protect themselves from trematode 
infection, but this increases their susceptibil- 
ity to visual predators (72). The tadpoles fear 
and avoid both predators and parasites, but 
do not perceive these threats as equal. When 
experimentally forced to choose between pre- 
dation and parasitism risk, tadpoles opt for 
infection (73). This makes sense because pre- 
dation has a larger short-term fitness effect 
than does parasitism, and the optimal behav- 
ior avoids the greatest immediate threat. 

However, parasite and predator avoidance 
are not always mutually exclusive. For in- 
stance, avoiding predator feces protects prey 
against both predators and their parasites 
(7/4), and, when animals group together, they 
dilute their risk of attack by both predators 
and some parasites (for example, ticks, bot- 
flies) (15). Although predator avoidance has 
received more attention, parasite avoidance 
also has strong effects on foraging, move- 
ment, and social interactions (1). 

Parasite avoidance is likely to have impor- 
tant consequences. For example, grazing ani- 
mals such as cows, sheep, and horses avoid 
fecal contaminated forage (4), and changing 
livestock practices to recognize the needs and 
capabilities of animals to avoid parasites will 
reduce the cost associated with avoidable 
livestock diseases. Furthermore, conservation 
planning and projections may benefit from 
knowledge of how shrinking and fragmented 
habitat limits the ability of animals to avoid 
risky sites and conspecifics. As populations of 
large predators continue to decline, parasite 
avoidance may become a relatively stronger 
driver of ecological processes. Integrating 
parasites into the ecology of fear is thus cru- 
cial to understanding the general ecological 
and evolutionary effects of parasites on eco- 
systems and predicting how ecosystems are 
likely to respond to the rapidly changing en- 
vironments of the Anthropocene. 
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PSYCHOLOGY 


Logic in babies 
12-month-olds spontaneously 


reason using 
process of elimination 


By Justin Halberda 


he success of science rests on our hu- 

man ability to reason logically. But 

where does this ability come from? 

Is it an inherent attribute of mind 

that even a young child might pos- 

sess, or is it a hard-won accomplish- 
ment mastered later in life? On page 1263 
of this issue, Cesana-Arlotti et al. (7) reveal 
that one essential form of logical inference, 
process of elimination, is within the toolkit 
of 12-month-old infants. This reveals the 
earliest known foundations of our human 
ability to reason logically. The race to doc- 
ument the range of early logical abilities 
shared by infants, adults, and nonhuman 
animals, and to determine how these foun- 
dational abilities empower our broader ca- 
pacities to reason, has begun. 

Every scientific method requires a sup- 
porting logic. For Francis Bacon (2), this 
was unfettered empirical observation fol- 
lowed by induction (reasoning from many 
cases to form a general principle). Karl 
Popper (3) stressed the importance of hy- 
pothesis testing and the ability to refute 
hypotheses found to be false (science as 
an extended instance of process of elimi- 
nation). And Thomas Kuhn (4) highlighted 
the dramatic changes that occur during 
scientific revolutions, in which wholly new 
models of phenomena are created through 
model building and abduction (sometimes 
called “inference to the best explanation”). 
Examples of such revolutions are the Co- 
pernican Revolution (5) producing the 
heliocentric model of the solar system, 
and the Einsteinian Revolution of special 
relativity in which space and time become 
one (6). In each of these cases (induction, 
hypothesis testing, abduction), the work 
of science is supported by an underlying 
logic. No logic, no science. 

Where does our human faculty to reason 
logically come from? At the end of the pre- 
vious century, Susan Carey suggested that 
such logic might be deep in our develop- 
mental roots: A young child might reason 
using intuitive theories, models, and abduc- 
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tion, much like a professional scientist (7). 
Other developmental psychologists pushed 
this further and suggested that even infants 
might be understood to be little scientists 
drawing inferences in the cradle about ob- 
jects and causes (8). But suggestions like 
these are frequently met with incredulity 
(9): After all, it often feels like logical rea- 
soning is effortful, conscious, and even lin- 
guistically based (10). These characteristics, 
if accurate, would seem to preclude the pos- 
sibility that preverbal infants could engage 
in any such process. 

Cesana-Arlotti et al. asked whether pre- 
linguistic 12- and 19-month-old infants 
would spontaneously reason using process 
of elimination. This is a form of inference 
also known as disjunctive syllogism or mo- 
dus tollendo ponens—it is any argument of 
the form: A or B, not A, therefore B. Cesana- 
Arlotti et al. relied on one of the few behav- 
iors babies voluntarily engage in—looking 
at whatever they find most interesting (77). 
They measured infants’ looking at comput- 
erized vignettes in which two different ob- 
jects (A and B) were shown being hidden 
behind a wall. Infants watched as a cup 
scooped one of the objects from behind 
the wall, and then came to rest next to the 
wall—critically, only the topmost edge of 
the contained object could be seen peeking 
out of the cup, such that infants could not 
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An infant at play 
is also an infant 
reasoning logically. 


tell for sure whether the object was A or B. 
At this moment, infants could have formed 
a disjunctive thought—for example, “either 
the object in the cup is object A or it is ob- 
ject B”” Next, this ambiguity was resolved: 
The wall dropped to reveal that object A was 
behind the wall, but the contents of the cup 
remained hidden. This is the moment of po- 
tential elimination, and an opportunity for 
infants to draw a key inference—“because 
object A is not in the cup, object B must be 
in the cup.” Finally, infants’ expectations for 
the cup’s contents were tested: Either the 
expected object (object B) or, surprisingly, 
another object A emerged from the cup. 
Infants looked longer at the surprising out- 
come—an indication that their expectations 
were violated and a hint that they were 
seeking further information to resolve the 
conflict (72). 

Potentially even more exciting, infants 
also showed signs of making the necessary 
inferences along the way—for example, 
upon seeing which of the two objects was 
behind the wall, but before the cup’s con- 
tents were revealed, infants’ pupils dilated 
and they tended to shift their fixation to 
the cup (consistent with them inferring 
which object must be inside). This pattern 
suggests that infants used the information 
they had seen to reason through a disjunc- 
tive syllogism (A or B, not A, therefore 
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B). There were also additional versions 
of these vignettes that manipulated the 
precise sequence of hiding and revealing, 
which allowed the authors to determine 
what specifically the infants were remem- 
bering and expecting during each moment. 

The careful crafting of stimuli and clever 
analyses of infants’ spontaneous looking 
behavior by Cesana-Arlotti et al. show us 
that infants have the capacity to reason 
by process of elimination. By contrast, 
whereas nonhuman animals such as dogs 
facing similar situations of ambiguity may 
ultimately form the right conclusion, they 
appear to arrive at this hunch using an as- 
sociative rather than logical process (13). 
Unwinding the similarities and differences 
among infant, child, adult, and animal rea- 
soning abilities is an important task for 
the future, as is determining why, despite 
being deployed by infants, reasoning feels 
effortful and why humans often “go with 
our gut” and rely on heuristics and biases 
rather than logical inference (14). 

A new field, studying the foundations of 
logical abilities, is emerging. As with other 
successes of developmental psychology 
in recent decades [as in work on the psy- 
chology of number, language, and theory 
of mind (75)], this new field will make use 
of behavior and brain studies with infants, 
children, and adults; studies of nonhuman 
animal abilities; studies of the correlations 
between reasoning and other abilities; and 
training studies to determine where we 
all begin and how maturation and experi- 
ence build upon these foundations. It is a 
thrilling time for us as scientists—using 
logical reasoning to understand how we 
reason logically. m 
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POLYMERS 


Random copolymers that protect proteins 


Synthetic polymers are designed to stabilize proteins in polar and nonpolar solvents 


By Alfredo Alexander-Katz! and 
Reid C. Van Lehn? 


cientists have tried and in some 

limited cases succeeded to harness 

proteins to do chemistry (J) or use 

them in functional materials. How- 

ever, most proteins only function 

correctly if they fold into specific 
conformations, which typically occurs with 
the assistance of other proteins (such as 
chaperones, translocons, or transporters) 
that mediate structure formation, mem- 
brane insertion, and intracellular traffick- 
ing (2, 3). Several methods have been used 
to improve protein stability in 
nonbiological environments— 
including micelle encapsula- 
tion, polymer conjugation, 
and sol-gel trapping (4)—but 
for most intended applica- 
tions, they suffer from low 
levels of functionality, diffi- 
cult chemical postfunctional- 
ization, or the requirement of 
very specific solvent environ- 
ments. On page 1239 of this 
issue, Panganiban et al. (5) in- 
troduce an approach for sta- 
bilizing proteins in disparate 
solvent environments that 
does not suffer from these 
drawbacks. 

The surfaces of folded pro- 
teins are chemically diverse, 
with small nonpolar, polar, 
and charged “patches” of the 
surface in close proximity. By 
characterizing the distribu- 
tion of patch size and sepa- 
ration for several exemplary 
proteins, Panganiban eft al. 
developed a statistical rep- 
resentation of the chemical 
diversity expected of protein 
surfaces. They then synthe- 
sized random heteropolymers 
(RHPs) composed of four 
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monomers with chemical properties de- 
signed to interact with the chemical spe- 
cies present on protein surfaces. Critically, 
the RHPs mimicked the statistical distribu- 
tion of protein surface patches rather than 
the specific surface properties of a protein 
of interest (see the figure). 

Using this design strategy, they iden- 
tified a RHP composition that strongly 
binds to protein surfaces in organic sol- 
vents yet weakly binds in water. Molecu- 
lar simulations at multiple length scales 
demonstrated that the RHP adopts local 
conformations to maximize favorable in- 
teractions with the protein surface, le- 


A coat that fits off the rack 


A statistical analysis of protein surfaces enabled Panganiban et al. to design random 
copolymers as coatings. These polymers stabilize proteins in a wide variety of solvents. 


The statistical distribution of chemically diverse 
patches (shown in different colors) on protein 
surfaces enabled the design of synthetic 
polymers that can bind to their surfaces. 


Theory 


Probability 


Patch size 
© Hydrophilic 
© Hydrophobic 


@ Cationic 
@ Anionic 


The polymer coating stabilizes proteins in polar 


(hydrophilic or aqueous) or nonpolar (hydrophobic 


or organic) environments. 
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Synthetic random 
copolymer 


Hydrophobic | 


veraging the polymer’s conformational 
flexibility and the complementary chemi- 
cal functionalities engineered into the 
polymer backbone. Experimentally, this 
interaction leads to astonishing protein 
folding and stabilization in a variety of 
non-native environments compared with 
alternative methods. The authors show 
that the same RHP composition increases 
membrane protein folding in water and 
increases water-soluble protein activity in 
organic solvent by an order of magnitude 
(or more). These results demonstrate that 
RHPs can act like omnifunctional chaper- 
ones that can stabilize both lipophilic pro- 
teins in aqueous solution and 
hydrophilic proteins in or- 
ganic solvents, without com- 
promising protein function. 
Researchers had previously 
proposed (and computation- 
ally proved) that statistically 
controlled random copolymers 
(such as unfolded proteins) 
could recognize “random” pat- 
terns on surfaces (6, 7). Pan- 
ganiban et al. validated this 
conjecture even for the curved 
surfaces of proteins, a remark- 
able result given the complex- 
ity of protein interfaces. This 
work provides a blueprint for 
matching RHPs to protein sur- 
faces, further blurring the line 
between the biological and the 
synthetic world. It also raises 
new questions. How many 
monomer chemistries are nec- 
essary to bind a given protein 
surface? Can other chemistries 
be developed along these lines? 
How do the physical properties 
of proteins, such as curvature, 
affect the necessary monomer 
statistical distribution? 
Expanding on the strategy 
used in this work, it may be 
possible to elucidate the rules 
used by biology to ensure pro- 
tein function by analyzing the 
properties of effective RHPs. 
For example, a key feature of 
the RHPs is their conforma- 
tional flexibility. Intriguingly, 
large conformational fluctua- 
tions and transient unfolding 
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are necessary for the function of water- 
soluble and membrane protein chaperones, 
suggesting that conformational flexibility 
may be a general feature of chaperones 
(8, 9). Comparing the chemical features 
of effective RHPs with the sequences of 
similarly flexible intrinsically disordered 
proteins may further enable the identifi- 
cation of proteins with chaperone activity. 
Optimizing RHP compositions based on the 
chemical diversity found in separate classes 
of proteins could also reveal the minimal 
chemical features necessary for chaperone 
action. Last, analyzing the thermodynam- 
ics underlying the insertion of RHP-bound 
membrane proteins into a lipid bilayer, 
possibly aided by simulation techniques 
such as those used in this work, could re- 
solve long-standing questions regarding the 
forces driving membrane protein folding 
and assembly (10). 

The demonstration by Panganiban et 
al. that RHPs can stabilize proteins in dif- 
ferent media without compromising their 
function should open the door for the 
generation of new technologies as well as 
scientific findings. This result could en- 
able the use of many proteins for diverse 
chemical processes, but the prospects of 


(17 


.../random heteropolymers] 
can act like omnifunctional 
chaperones that can 
stabilize both lipophilic... 
and hydrophilic proteins...” 


this work extend beyond chemistry. For 
example, one can envision stabilizing pro- 
teins in healthy fats in order to solve food 
problems in regions where refrigeration is 
not available, or using biodegradable ran- 
dom copolymers to neutralize viruses or 
toxic agents to prevent infection or biologi- 
cal attacks. 
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CANCER THERAPY 


Beyond PARP-POL® 
as an anticancer target 


Targeting cancers dependent on DNA polymerase @ 
has considerable clinical potential 


By Geoff S. Higgins! and Simon J. Boulton?* 


ince the discovery that DNA is un- 

stable and prone to decay, targeting 

DNA repair deficiencies has become 

a proven and effective strategy in the 

fight against cancer. Over the past 

decades, it has become increasingly 
apparent that selective loss of DNA repair 
pathways is an early and frequent event 
in tumorigenesis, occurring in 40 to 50% 
of many cancer types. Loss of DNA repair 
likely provides a selective growth advantage 
to tumor cells as this results in genetic in- 
stability and/or enhanced mutation rates, 
which can drive tumor evolution (7). How- 
ever, DNA repair-deficient cancers often be- 
come critically dependent on backup DNA 
repair pathways, which present an “Achil- 
les heel” that can be targeted to eliminate 
cancer cells. This is the basis of synthetic 
lethality and is exemplified by the success of 
poly(ADP-ribose) polymerase (PARP) inhibi- 
tors in treating BRCA-deficient breast and 
ovarian cancers. However, new therapeutic 
strategies are urgently needed to overcome 
acquired and innate PARP inhibitor (PARPi) 
resistance and to exploit other DNA repair 
deficiencies. Optimism is increasing that tar- 
geting DNA polymerase 9 (POL®@), which is 
involved in DNA repair, will not only syner- 
gize with PARPi’s but may have broader util- 
ity in cancer treatment. 

The realization that tumor-specific altera- 
tions in DNA repair pathways could be ex- 
ploited therapeutically was first established 
in 2005 (2, 3). Inactivating BRCA muta- 
tions, which account for ~10% of hereditary 
breast and ovarian cancers, are defective 
for homologous recombination (HR), a key 
pathway of DNA double-strand break (DSB) 
repair. BRCA-deficient cancer cells, but not 
the wild-type controls, were found to be ex- 
quisitely sensitive to inhibitors of the single- 
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strand break repair enzyme PARP owing to 
the inability to deal with the lesions induced 
by PARPi in the cancer cells (2, 3). Recent 
studies have established that PARPi’s act by 
trapping PARP on DNA. This stalls replica- 
tion forks, which require HR for stabilization 
and replication restart (4). Hence, cells defi- 
cient in HR (for example, with BRCA muta- 
tions) are unable to respond to trapping of 
PARP on DNA, leading to cell death. 

Currently, there are five PARP inhibitors 
in clinical trials, which have clear benefit 
in some patients with HR-deficient (HRD) 
cancers. BRCA mutations represent the most 
widely studied HRD subtype and have previ- 
ously been shown to be associated with the 
greatest progression-free survival benefits 
from PARPi therapy (5). Unfortunately, of 
the HRD cancers that do respond to PARPi’s, 
ongoing studies suggest that these tumors 
ultimately progress because of acquired 
PARPi resistance. It is also clear that PARPi’s 
do not work in all DNA repair-deficient tu- 
mors, such as gastric cancers defective for 
DNA damage signaling by ataxia-telangiec- 
tasia mutated (ATM) (6), nor do they target 
increased spontaneous DNA damage caused 
by DNA replication stress, which is a hall- 
mark of many tumors. POL@ has emerged as 
a promising target for the treatment of vari- 
ous DNA repair-deficient cancers, including 
those that are HRD. Evidence suggests that 
POL# inhibitors could synergize with PARPi’s 
(Z 8), as they act by a different mechanism, 
and may also provide a means to selectively 
sensitize tumors to radiotherapy (9). 

DSBs are predominantly repaired by the 
nonhomologous end joining (NHEJ) DNA 
repair pathway in G, phase of the cell cycle 
and by HR in S-G, phases. Microhomology- 
mediated end joining (MMEJ) is a “backup” 
DSB repair pathway in the event that NHEJ 
or HR are compromised (see the figure). 
HR and MMEJ share the same substrate, 
namely, a resected DSB that contains a 3’ 
single-stranded DNA (ssDNA) overhang 
bound by replication protein A (RPA). 
Hence, if HR fails or is deficient, then MMEJ 
is the favored option to repair resected DSBs. 
POL# possesses both DNA polymerase and 
helicase domains, which are important for 
MMEJ in vivo (10). Biochemical evidence 
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suggests that the helicase domain displaces 
RPA at resected DSBs to facilitate annealing 
of ssDNA containing microhomologies (short 
segments of complementary DNA sequence). 
The polymerase activity then completes DNA 
synthesis to fill in the resected gap before li- 
gation completes the MMEJ repair reaction 
(10). MMEJ is error-prone as it often results 
in loss of genetic information at the 
break site and can promote repair 
between DSBs from different parts 
of the genome, leading to transloca- 
tions and other complex rearrange- 
ments. Although other proteins such 
as RPA and CtBP-interacting protein 
(CtIP) are important for MMEJ, their 
roles in DNA damage recognition 
and in other DNA repair processes 
arguably make them less attractive 


DSBs occurring when chromosomes 
are duplicated (S and G, phase) 
means that homologous sequences 
are present for error-free repair. 


proteins such as PARP! or p53-binding pro- 
tein 1 (53BP1) (/4). It is possible that POLO 
inhibitors would induce tumor response in 
these instances of PARPi resistance. Tumors 
may eventually develop resistance to POLO 
inhibitors, although the mechanisms likely 
to be involved are not yet clear. It is, how- 
ever, anticipated that a primary mechanism 


DSB repair by HR and MMEJ 


In HRD cancer cells, MMEJ becomes active, making POL@ a cancer- 
specific therapeutic target that induces synthetic lethality in HRD tumors. 


HR MMEJ 


MME) is error-prone as it results 
in loss of genetic information at 
the DSB site and can repair DSBs 
originating from different parts 


apy in HR-proficient tumor cells at a level 
that would be expected to have considerable 
clinical benefit (9). Radiotherapy is widely 
used in the curative management of cancer 
patients, either as an adjunct to surgery, or 
as definitive treatment alone or in combi- 
nation with cytotoxic chemotherapy. POL# 
is overexpressed in several common cancers 
that are routinely treated with ra- 
diotherapy, such as lung and rectal 
cancer (11). Despite developments 
in the precision and accuracy of 
radiotherapy, the unavoidable ir- 
radiation of healthy tissues sur- 
rounding tumors can induce side 
effects that limit the dose of radia- 
tion that can be delivered and thus 
the tumor cell killing that can be 
induced. The marked disparity in 


of the genome (leading to trans- . 

targets for drug development be- STETOTETSITEFATETEN ACE: #UGITTOITITSITHOITD locations and chromosome POL@ expression between tumor 
cause of their lack of specificity. rearrangements). cells and most normal tissues has 

One of the principle reasons why =RPA—®®& CtIP-dependent raised the exciting prospect that 
POLO has become a major focus for resection of DSB specific POLO inhibitors, delivered 
drug discovery is that its expression SUNN, \))))))\ Toe Srnsnnia, errr concurrently with radiotherapy, 
is largely absent in normal cells but a AM i a ee could improve tumor control with- 
is increased in many cancers, both RADS1—9 BRCAI, BRCA2, RPA out exacerbating the associated 
with and without HRD (7). Indeed, oe ‘ and others a displacement side effects of radiotherapy. 


POLO overexpression in multiple 
HR-proficient tumor types such as 
lung, gastric, and colorectal cancer, 
is associated with adverse clinical 
outcomes (11). Critically, tumor cells 
harboring DSB repair defects de- 
velop a pronounced dependency on 
MMEJ to the extent that synthetic 
lethality has been shown to occur 
when POL is disrupted in cells defi- 
cient in either HR, NHEJ, or ATM (7, 
8, 12). The reasons why POL# is up- 
regulated and associated with poor 
outcomes in many tumors is not well under- 
stood, but one possibility is that POLO repairs 
spontaneous DNA damage present in cancer 
cells and therefore affords them a growth ad- 
vantage (73). Alternatively, error-prone DNA 
repair associated with POL# activity could 
also contribute to greater genomic instability 
and lead tumors to develop greater genetic 
diversity and evolutionary plasticity, which 
can drive resistance to cytotoxic therapies. 
Despite the substantial clinical benefits 
observed with PARPi monotherapy in BRCA- 
mutated cancers, treatment resistance is a 
common occurrence, and POLO inhibition 
might be beneficial in treating those tumors 
that progress on PARPi’s. PARPi resistance 
can arise from a variety of genetic altera- 
tions, including those that reactivate HR. 
Although POL# inhibition would not be 
expected to cause tumor response in cases 
where secondary reversion BRCA mutations 
cause restoration of HR activity, over half 
of recurrences arise from other changes, in- 
cluding decreased expression of DNA repair 
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would be secondary BRCA mutations caus- 
ing reinstatement of HR function. 

Because of their distinct mechanisms of 
action, combined PARPi and POL# inhibitor 
therapy might be expected to have a syner- 
gistic effect; simultaneously driving DSB 
formation while inhibiting DSB repair. Dual 
POL# and PARP inhibition might therefore 
lead to more prolonged, clinically meaningful 
responses and possibly delay the emergence 
of resistant disease compared with either 
therapy alone. In vitro data support this op- 
timism, because PARPi treatment, combined 
with reduced POLO expression, has been 
shown to synergistically reduce colony for- 
mation in BRCAI-deficient cells (7). The dra- 
matic tumor growth delay seen in xenograft 
studies, whereby human HRD tumor cells are 
subcutaneously engrafted in mice, also points 
to a synergistic effect of POL9 and PARP defi- 
ciency in tumors with impaired HR (7). 

In addition to the synthetic lethal effects, 
disruption of POL# has also been shown to 
induce sensitization of tumors to radiother- 
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The clinical potential for POLO 
inhibition to prolong the survival 
of patients with BRCA-mutated tu- 
mors, or HRD tumors in general, 
and to improve the cure rates of 
patients receiving radiotherapy are 
substantial. It is hoped that the abil- 
ity to define those patients likely to 
benefit from POLO inhibitor treat- 
ment should enable this emerging 
class of anticancer therapy to be 
rapidly translated into the clinic. 
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EDUCATION 


Societal inequalities amplify 
gender gaps in math 


Egalitarian countries cultivate high-performing girls 


By Thomas Breda,!? Elyés Jouini,?? 
Clotilde Napp”” 


hile gender gaps in average math 
performance are now close to zero 

in developed countries, women 

are still strongly underrepre- 

sented among math high perform- 

ers (1). This gender gap contributes 

to the underrepresentation of women in 
math and science in higher education and 
to their subsequent worse position in the la- 
bor market (2, 3). With the roles of nature 
and nurture (4—6) on gender performance 
gaps having been debated for more than a 
century, research in the 1990s and 2000s 
(7-9) suggested a cultural origin, relating 
gender gaps in math to measures of coun- 
tries’ gender inequality. However, with more 
recent studies (J0—12) having shown that this 
relation is weak, today we have no clearly 
identified relationship between countries’ 
socioeconomic or cultural environment and 
the gender gap in math. We relate below gen- 
der gaps in math to societal inequalities that 
are not directly related to gender. We find a 
strong and robust relationship and provide 
tests suggesting that it is causal: Countries 
that are generally more egalitarian, or that 
have institutions more conductive to equal- 
ity, have a lower gender performance gap 
in math, suggesting that this gap is partly 
shaped by more general societal inequalities. 
According to the Programme for Interna- 
tional Student Assessment (PISA), there are 
on average only seven girls for ten boys in the 
top decile of the math performance distribu- 
tion among the 35 countries belonging to 
the Organisation for Economic Co-operation 
and Development (OECD). Underrepresenta- 
tion of girls at high levels of performance is a 
common feature of all OECD countries (table 
Sla) and has remained remarkably stable 
since 2000 (table Sib). Gender gaps of the 
same magnitude are also observed in science 
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and reading, the latter in favor of girls (table 
S1). Much former research has been based 
on the gender stratification hypothesis, ac- 
cording to which gender differences in 
opportunities and status shape numerous so- 
cialization processes that in turn may affect 
performance (7). We elaborate on this idea, 
assuming that processes that transform dif- 
ferences in status between social groups into 
differences in performance depend on the 
degree of countries’ inclusiveness; inclusive 
countries are likely to mitigate the impact of 
status differences in general. We hypothesize 
that women have a lower status than men in 
virtually all countries but that this lower sta- 
tus is more likely to be detrimental to girls’ 
performance in countries that are in general 
less fair and inclusive; the more unequal a 
country, the more the status difference be- 
tween boys and girls should translate into 
differences in performance. 


INEQUALITIES AND PERFORMANCE 

We analyze data from five successive waves of 
PISA, an every-three-year international assess- 
ment of the knowledge and skills of 15-year-old 
students in mathematics, reading, and science 
taking place in more than 70 countries (see de- 
tails on all data and analyses in the supplemen- 
tary materials (SM)]. We relate gender gaps in 
PISA performance to measures of countries’ 
societal inequalities that are not directly re- 
lated to gender but reflect (i) general socio- 
economic and cultural inequalities, including 
pure income inequalities, or (ii) educational 
inequalities, including inequalities in perfor- 
mance and in learning opportunities (73). 

We focus primarily on “high performers” 
at levels 5 or 6 from the six PISA proficiency 
levels (10.7% of all students in OECD coun- 
tries in math in 2015). We use as our main 
measure of the gender performance gap in 
math the ratio of girls to boys among these 
high performers. This is because (i) this ra- 
tio is very unfavorable to girls and has not 
narrowed over time and (ii) performance at 
high levels is related to underrepresentation 
of women among STEM college graduates. 

Among the 35 OECD countries observed 
in 2015, this ratio is negatively and signifi- 
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cantly correlated with socioeconomic and 
cultural inequalities such as the income 
Gini index (r = -0.55) (see fig. S1) or the 
variance in the socioeconomic and cultural 
background of a country’s students (r = 
-0.66), with inequalities in school perfor- 
mance such as the share of students from 
a low socioeconomic and cultural back- 
ground among high performers (7 = 0.70) 
(see fig. S1), or with inequalities in learning 
opportunities at school such as the between- 
school variation in students’ socioeconomic 
background (7 = -0.61). 

Consistent with a link between math gen- 
der gap and the way countries perpetuate 
or reduce initial status differences, we find 
a strong correlation between the gender 
gap in math and intergenerational earn- 
ings elasticity (7 = -0.63), education mobil- 
ity measured as parents-child correlations in 
years of schooling (r = -0.54), or the index 
of inequality of economic opportunity (7 = 
-0.64), which measures income inequality 
due to predetermined circumstances beyond 
individual control (such as region of birth or 
parental background). 

A more systematic study of those relation- 
ships using linear regression models shows 
that for a large variety of measures of soci- 
etal inequalities, a higher level of inequal- 
ity is associated with a significantly lower 
girls-to-boys ratio among high performers in 
math (see the table and table S2, column 1). 
About 30% of the cross-country variance in 
this ratio can be accounted for by a single 
measure of inequality, and the explained 
variance reaches 60% when three variables 
reflecting different aspects of inequalities 
are jointly included. 

Several robustness checks show that the 
relation is still observed when (i) controlling 
for countries’ gross domestic product (GDP) 
and extent of gender stratification (table S3); 
(ii) including 37 partner non-OECD countries 
(table S4); (ii) considering the gender gap in 
average performance, although the relation 
is weaker (table S5); (iv) using different mea- 
sures of the gender gap in math among high 
performers (table S6); (v) using former PISA 
surveys or other data sources (see, e.g., table 
S7); and (vi) using lagged measures of in- 
come inequalities [in the 1980s and to some 
extent in the 1950s and 1960s (see SM)]. We 
also show that inequalities are associated 
with lower school performance of both boys 
and girls, with the estimated association sys- 
tematically larger for girls (table S8). 

We conclude that the relationship be- 
tween the math gender performance gap 
and societal inequality is larger and more 
robust than other relationships already doc- 
umented with more obvious country char- 
acteristics such as gender stratification or 
economic development. 
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To back up this point, we run “horse races” 
and also apply three standard machine-learn- 
ing techniques for model selection to com- 
pare the explanatory power of measures of 
gender inequality and of countries’ develop- 
ment to our measures of non-gender-related 
inequalities (see SM). For instance, the effect 
of the Gender Gap Index (GGI) or of the Fe- 
male Labor Force Participation on the gender 
gap in math becomes statistically nonsignifi- 
cant as soon as a measure of (non-gender-re- 
lated) societal inequalities is introduced as a 
competing explanatory variable in regression 
models (table S9). 

Societal inequalities are also associated 
with lower girls-to-boys ratios in science 


in more unequal countries (see the table, col- 
umn 2, and table S10). This shows, in partic- 
ular, that our main results are not driven by 
differential effects of parents’ education on 
their daughters’ and sons’ math performance 
or by differential allocation of household re- 
sources across children’s gender. 

To explore whether unobserved country 
characteristics are driving the results, we 
control for countries’ time-constant unob- 
served heterogeneity using country fixed- 
effects models. Variations over time in our 
main inequality indicators are almost all 
significantly related to variations over time 
in the girls-to-boys ratio in math, showing 
that countries that reduce relatively more 


Inequalities and gender gaps among high performers in math 


Estimates in column 1 are obtained from separate simple linear country-level regression models on 35 OECD 
countries in PISA 2015. The dependent variable is the ratio of the numbers of girls and boys among high math 
performers. Column 2 provides similar estimates based on individual-level regressions with several control 
variables and adjusted standard errors (see SM). Column 3 is a country fixed-effect version of column 1 based 


on PISA 2003, 2006, 2009, 2012, and 2015. 


COUNTRY-LEVEL INDIVIDUAL-LEVEL COUNTRY FIXED-EFFECT 
EXPLANATORY VARIABLE REGRESSIONS LOGISTIC MODELS MODELS (2003-2015) 
Income inequalities (GINI index) 0.067** -0.094** -0.094** 
Inequalities in socioeconomic and -0.081** -0.169** -0.043* 
cultural background 
Socioeconomic inequalities -0.085** -0.157** -0.027* 
in performance 
Inequalities in learning =0:0/5** -0.154** -0.075 


opportunities 


** P< 0.01, *P < 0.05. See SM for methodological details, the exact definition of inequality variables, sample sizes, standard errors, and R-squares. 


(increasing the gender gap, as in math) and 
higher boys-to-girls ratio in reading (reduc- 
ing the gender gap) (tables $2, S4, S5, and 
$8). Consistent with our hypothesis, inequal- 
ities are detrimental to the performance 
of girls relative to boys in the three topics 
math, reading, and science. 


UNDERSTANDING ORIGINS 

To suggest causality and better understand 
the origin of the relationship between non- 
gender-related inequality measures and 
gender performance gaps, we offer three 
different strategies: (i) individual-level re- 
gressions, (ii) panel analysis, and (iii) instru- 
mental variables. 

We first check that our results are not 
driven by cross-country differences in 
observable students’ characteristics. We 
replicate our analysis using the full PISA stu- 
dent-level data, which contains information 
on students’ grade repetition, parents’ educa- 
tion, and households’ economic and cultural 
resources. Conditional on these controls and 
their interaction with students’ gender, girls 
still have a lower probability relative to boys 
to score above the high-performance cutoff 
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(or increase relatively less) socioeconomic 
inequalities also reduce relatively more the 
gender performance gap in math (see the 
table, column 3, table S11, and fig. $2). 

To assess further a possible causal link, 
we exploit institutional differences between 
countries’ labor markets as instruments for 
their extent of income inequality. We argue 
that institutional features such as bargain- 
ing coverage, union density, or the value 
of the minimum wage are not likely to have 
a direct link with the gender performance 
gap at school, such that any impact they 
may have on the math gap would likely be 
through their impact on social inequalities. 
To obtain estimates less likely to reflect re- 
verse causality or omitted variables biases, 
we retrieve variations in the Gini index that 
are solely explained by those institutional 
variables. These “instrumented” inequalities, 
solely driven by institutional factors, affect 
the gender gap in math to the same extent 
as noninstrumented inequalities (table S12). 

We also study whether institutional fea- 
tures of education systems affect gender per- 
formance gaps by considering measures of 
inequalities in learning opportunities across 
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schools or across students’ socioeconomic 
background, measures of vertical stratifica- 
tion at school, such as the extent of grade 
repetition, and measures of the quality of ed- 
ucation. All those measures, known to affect 
socioeconomic inequalities at school (/4, 15), 
directly relate to gender performance gaps 
across countries (tables $2 and S4 to S7). 

It is striking that general indicators of 
inequalities can explain so well the pat- 
terns of gender differences in math, science, 
and reading performance across countries 
(whereas other indicators directly related to 
gender stratification have limited explana- 
tory power). In more egalitarian countries, 
differences in initial status seem less likely 
to translate into differences in performance, 
and girls are more represented among high 
performers as are, for example, students 
from a low socioeconomic and cultural 
background. This suggests that the gender 
gap in math is a form of social inequality 
like many others. 

This is consistent with our results that gen- 
der performance gaps at school are linked to 
countries’ institutions that more generally 
reduce social and economic inequalities. As 
a consequence, gender equality may not only 
be a matter of gender norms and stereotypes. 
General policies in favor of more inclusive, 
less vertically stratified, and more standard- 
ized education systems may also have a posi- 
tive impact on girls’ performance. 
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Women’s work 


Stepping into scientific roles left vacant during World War I 
brought gains for women in and out of the laboratory 


By Laura Micheletti Puaca 


he ability of war to disrupt gender roles 
is well established. Amid the chaos of 
fighting, the enlistment of men, and 
the need to keep home fires burning, 
women have historically stepped into 
roles previously considered 
masculine domains. In World War 
J-era Britain, women could be 
found in a variety of new places, 
such as the munitions factories 
and military auxiliaries that have 
attracted the most attention by 
scholars to date. Less well known 
is women’s work in scientific 


argues, “in wartime, the need to find out 
why bread goes mouldy was as pressing as 
the invention of a new explosive.” 

Although some women received recogni- 
tion for their work, many did not, a situation 
that raises much larger questions about the 
nature of women’s wartime gains. On the one 
hand, many enjoyed increased ed- 
ucational and professional oppor- 
tunities that had not been made 
available to them before the war. 
Yet those opportunities were also 
circumscribed by gender stereo- 
types and traditional conventions 
regarding women’s work. 

Throughout the war, female 


fields. With A Lab of One’s Own, Abab of One'sOwn — scientists were overwhelmingly 
Patricia Fara aims to correct this Poietanal relegated to the lowest-paid and 
imbalance, masterfully bringing Et Weeldiec least-prestigious positions. Much 
to light women’s wartime contri- Patricia Fara of their work, moreover, involved 
butions to these areas, as well as Oxford University tasks that were not only tedious 


their efforts to improve women’s 
status in both science and society. 

During World War I, countless British 
women designed airplane equipment, in- 
vented drugs, tested bombs, studied the ef- 
fects of tear gas, and cracked codes. They 
also ran hospitals and science museums and 
taught at universities. Even seemingly “fem- 
inine” fields, such as nutrition, assumed in- 
creased importance during the war. As Fara 
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but also dangerous, such as in the 
case of industrial chemists, where 
women made up almost 90% of their ranks. 
Although outright hostility to female sci- 
entists seemed to diminish during the war, 
women continued to be marked as outsiders. 
Their appearance was routinely scrutinized, 
and some scientific women also felt pressure 
to reassure anxious observers of their femi- 
ninity, as if to offset the masculine nature 
of their work. The assumption that women 
in science were temporary participants was 
made all the more apparent when many 
male veterans returned to reclaim their jobs. 
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Renowned as a paleontologist, Marie Stopes also 
championed scientifically sound sex education. 


As Fara demonstrates, however, women’s 
wartime participation was not all for naught. 
Some were in fact successful in applying their 
wartime training to postwar careers. Their 
wartime experiences, coupled with a postwar 
influx of government money into science, ed- 
ucation, and industry, allowed them to enjoy 
expanded opportunities once again. “They 
were still not getting their fair share,” Fara 
concedes, “but at least the cake was larger.” 

Others, regardless of whether they re- 
mained in scientific fields, found that their 
self-confidence and expectations had been 
irrevocably heightened by their participa- 
tion in wartime science. Compared with the 
short-term nature of some gains, this trans- 
formation was long-lasting. 

Some of the most fascinating parts of 
the book include analyses of women who 
blended their work and their commitment 
to women’s rights. During the war, for ex- 
ample, Louisa Garrett Anderson and Flora 
Murray—who were medical pioneers, suf- 
fragettes, and long-term companions—set up 
a number of hospitals that were staffed en- 
tirely by women. Their most famous venture, 
the Endell Street Hospital in the Bloomsbury 
section of London, was an explicitly politi- 
cal organization with clear ideological and 
financial ties to the suffrage movement. 

Meanwhile, other scientifically trained 
women gravitated toward the National Union 
of Women’s Suffrage Societies (NUWSS), 
which took on important wartime work. For 
much of the conflict, the NUWSS assumed 
the role of a patriotic employment agency, 
which involved, among other things, training 
women to carry out scientific and technical 
jobs usually held by men. Unfortunately, de- 
spite figuring prominently in the title, suf- 
frage does not get as much attention as one 
might expect in A Lab of One’s Own. More of 
these stories would have enriched the book. 

As Fara is careful to note, the degree to 
which women’s wartime contributions influ- 
enced the 1918 decision to grant suffrage to 
women over 30 (subject to other restrictions) 
is the subject of much debate. What can be 
said with more certainty is that “public opin- 
ion was nudged towards approving female 
suffrage” in recognition of women’s work. 

Perhaps a more important issue, though, 
is the role that suffrage played in improving 
women’s status. Many professional women 
viewed the vote as the first step in a much 
longer battle. For them, being paid, pro- 
moted, and respected in their fields was just 
as, if not more, important. Indeed, these con- 
cerns remain pressing ones today. 
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Life in triplicate 
A tale of triplets separated at birth raises red flags 
and questions about the role of nature and nurture 


By Gabrielle Kardon 


ow much are we shaped by our ge- 
netic makeup, and how much does 
the environment in which we are 
raised contribute to our physical ap- 
pearance, our personalities, and our 
preferences? The relative im- 
portance of nature versus nurture is 
a fundamental question in biology 


Three Identical 


started a business together—a restaurant 
aptly named Triplets. 

But why were the triplets separated at 
birth? Their adoption, we learn, was arranged 
by the preeminent (now defunct) New York- 
based Louise Wise Adoption Agency, founded 
in the early 20th century to match Jewish or- 
phans with adoptive families. 

In the 1960s, some psychologists 
believed that separating orphaned 


and is notably difficult to study in Strangers twins or triplets afforded a bet- 
humans, due to ethical concerns. Tim Wardle, ter opportunity for each to forge 

Analyzing identical twins is one director an independent identity. The trip- 
way to examine this question, but Lean lets, however, were deliberately as- 


most twins are raised in a single 
household, where the effects of nature and 
nurture are inextricably intertwined. The 
story of not two, but three, identical broth- 
ers separated at birth is the subject of an 
insightful and disturbing new documen- 
tary, Three Identical Strangers, that pre- 
miered this January at the 2018 Sundance 
Film Festival. 

In 1980, Robert Shafran arrived at Sul- 
livan County Community College in up- 
state New York as a new freshman and was 
greeted by unexpected backslaps from fel- 
low students welcoming him back to school. 
He realized he was being mistaken for a 
former student, with whom he quickly ar- 
ranged a meeting. 

Indeed, he and Eddy Galland looked ex- 
actly the same, but what’s more, they shared 
a birthday and were both adopted, leading 
the two to conclude that they must be twins. 
Newspaper headlines and photos of the pair 
attracted the attention of yet another curly- 
haired 19-year-old, David Kellman, and the 
three discovered that they were triplets sep- 
arated at birth. 

A media frenzy ensued—complete with 
television interviews with Tom Brokaw 
and Phil Donahue—and a giddy period of 
discovery began. Not only were the trio 
identical physically, they shared many 
specific behaviors: All were wrestlers, had 
a preference for dating older women, and 
(apropos of the time) smoked the same 
brand of cigarettes. The long-lost brothers 
bonded, joyfully exploring the 1980s New 
York City party scene together, and even 
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signed to particular families as part 

of a secret study designed to examine the 
effects of nature versus nurture. 

The study, led by child psychoanalyst 

Peter Neubauer, placed multiple sets of 

twins and the triplets with families whose 


socioeconomic status, and even family 
structure, were carefully controlled. The 
families were aware of neither the other 
identical siblings nor their participation in 
the study. 

Yearly comprehensive questionnaires 
and film documentation of the siblings’ 
milestones were couched as standard fol- 
low-up studies of adopted children. As the 
film unfolds, even more sinister details of 
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Eddy Galland (left), David Kellman, and Robert Shafran (right) were placed in separate households as infants. 


the study and its effects on the triplets’ 
lives are revealed. 

Five years in the making, Three Identi- 
cal Strangers deftly interweaves interviews 
with the triplets, their family members, 
and the few surviving researchers associ- 
ated with the research. The film tackles 
both the questionable underpinnings of the 
study and the fundamental questions it at- 
tempted to address. 

Current rules of informed consent would 
now preclude the existence of such a study, 
and Wardle implies that the moral and ethi- 
cal issues of the study were likely of con- 
cern to Neubauer, who died in 2008, as well 
as to the Louise Wise Agency. The irony 
of a Jewish researcher and a Jewish adop- 
tion agency conducting a twin study after 
the atrocities waged against Jewish people 
in Nazi Germany is clear and perhaps the 
reason that Neubauer never published the 
study and sealed his data in a Yale archive 
until 2066. 

Although the study was clearly unpar- 
donable, its questions are enduring. How 
much are we the product of genetic inheri- 
tance? How does the environment shape 
who we become? To what extent do we have 
free will? 


% te 


The striking divergence of Shafran, Gal- 
land, and Kellman in physical appearance 
after 1980 confirms that genetics control 
just part of our makeup, but many questions 
remain unanswered. Winner of a special 
Sundance jury award for storytelling, Three 
Identical Strangers will stimulate viewers to 
think about these important issues. = 
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Edited by Jennifer Sills 


Arrival routes of first 
Americans uncertain 


In their Perspective “Finding the first 
Americans” (3 November 2017, p. 592), T. J. 
Braje et al. argue that people first entered 
the Americas about 25,000 to 15,000 years 
ago by way of the Pacific coast. We believe 
that current evidence yields far less cer- 
tainty than Braje et al. suggest—and more 
likely supports a later arrival by inland 
and/or coastal routes. 

Contrary to Braje et al., genetic evidence 
indicates that Native American ances- 
tors diverged from Siberian populations 
~24,900 to ~18,400 years ago, followed by 
population expansion into the Americas 
~16,000 to ~13,500 years ago (7). Only two 
Native American lineages have been identi- 
fied south of Beringia: a northern lineage 
constrained to northern North America, 
and a southern lineage directly linked with 
Clovis—the earliest unequivocal wide- 
spread cultural manifestation south of the 
ice sheets (2). Thus, a Native American 
lineage in the Americas between 25,000 
and 15,000 years ago is inconsistent with 
current data. There is no consensus on 
the validity of purported pre-16,000-year- 
old sites, which vary in accurate dating, 
unambiguous artifacts, and clear associa- 
tion between them (3). Moreover, there 
are few technological connections among 
16,000- to 13,500-year-old pre-Clovis sites, 
or with later Paleoindian artifacts; thus, 
the relationships between these sites and 
later Native Americans remain ambiguous. 

The coastal colonization route Braje et 
al. advocate remains a viable hypothesis 
for a later arrival date, but several issues 
should be addressed (4). Despite the rise 
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of sea levels in the Holocene, much of the 
late Pleistocene coast from Puget Sound 

to Alaska remains above sea level (5), yet 
surveys have failed to discover coastal sites 
securely dated older than 12,500 years ago 
(4). Furthermore, widespread empirical 
patterns are inconsistent with the coastal 
hypothesis: All known populations in 
Siberia, Russian Far East, and Beringia had 
terrestrial-oriented economies and tech- 
nologies, as did widespread Paleoindian 
groups (6, 7), with limited evidence of 
coastal exploitation in lower latitudes (8). 
This empirical patterning suggests that 
they were more likely to follow a land 
route through Siberia, Beringia, and the 
Americas south of the ice sheets. Current 
studies indicate that deglaciation began 
19,000 years ago and that an ice-free corri- 
dor, largely vegetated and free of proglacial 
lakes, existed by at least 15,000 to 14,000 
years ago (9, 10). 

Braje et al. suggest that stemmed 
projectile points in different contexts 
provide evidence for a coastal expansion 
before 16,000 years ago, but this is not a 
consensus view. Stemming is a widespread 
form of weapon design that was innovated 
numerous times and thus cannot be used 
to argue for cultural affiliation. No tech- 
nological analysis has established a valid 
connection between these disparate assem- 
blages, and there remains debate on the 
dating of North American stemmed points 
(11) because most securely dated sites are 
younger than Clovis. 

Braje et al. assert that there is near 
complete agreement among archaeologists 
on these issues. However, the most recent 
survey (72) showed that archaeologists 
are divided, with many thinking that both 
interior and coastal routes were used, 
and expressing skepticism about several 
proposed pre-Clovis sites. Genetic and 
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In this watercolor, Clovis Paleoindians journey 
eastward toward the Americas. 


archaeological data suggest expansion 
from Siberia into the Americas around 
16,000 to 13,500 years ago, consistent 
with terrestrial and/or coastal migrations. 
This evidence base explains the absence 
of consensus among scientists regarding 
both routes and timing of the peopling of 
the Americas. 
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Response 


In our Perspective, we argue that the earli- 
est Americans followed a coastal migration 
route. Potter et al. counter that an inland, 
ice-free corridor route was more likely. The 
genetic and archaeological evidence that 
Potter et al. discuss support both the tim- 
ing and coastal route for the peopling of 
the Americas presented in our Perspective. 
Potter et al. are disappointed, however, that 
we did not also emphasize the possibility of 
an interior ice-free corridor migration and 
more tightly constrain colonization to after 
16,000 years ago. This misses the main 
points of our Perspective: A Pleistocene 
coastal migration into the Americas 

has been bolstered tremendously by the 
discovery of early coastal and near coastal 
sites in the Americas, and more research, 
particularly underwater archaeology, is 
badly needed. 

Potter et al. imply that in the most 
recent survey “archaeologists were divided, 
with many thinking both interior and 
coastal routes were used.” In fact, 86% of 
respondents selected “coastal migration,” 
compared with 65% for “interior passage 
migration” (1). This represents a sea change 
from 30 years ago when funding dollars 
and research efforts were funneled toward 
ice-free corridor models of New World 
colonization, resulting in decades of mar- 
ginalization of research into early coastal 
migrations and adaptations (2). 

Potter et al. underestimate the complex- 
ity of finding intact late Pleistocene coastal 
sites. Even in geological contexts where 
Pleistocene shorelines remain aboveground 
(such as in Alaska and British Columbia), 
coastal erosion and other forces tend to rav- 
age or obscure early sites. In contrast to the 
sustained and intensive efforts to identify 
early sites in interior regions, the search for 
Pleistocene sites in coastal settings is just 
getting started. A growing number of such 
sites have been identified in places where 
systematic efforts to find late Pleistocene 
to early Holocene coastal sites have been 
made: the Pacific Northwest, the islands of 


SCIENCE sciencemag.org 


Alta and Baja California, and coastal Peru 
and Chile (3-6). 

Recent genomic evidence suggesting a 
migration into the Americas from Northeast 
Asia between ~16,000 and 13,500 years 
ago does not preclude a coastal migration, 
and ages based on molecular clocks are 
approximations due to unpredictable varia- 
tion in mutation rates. Potter et al. suggest 
that we argue for a migration earlier than 
16,000 years ago based on the distribu- 
tion of stemmed projectile points. We did 
argue that northeast Asian stemmed point 
technologies may support Native American 
origins sometime in the past 20,000 years— 
like genetic data—but that more work is 
needed to link these Asian technologies to 
similar but younger terminal Pleistocene 
toolkits in the Americas. 

Potter et al. subscribe to a window of 
colonization between about 16,000 and 
13,500 years ago, the latter half of which is 
almost certainly incorrect given the widely 
accepted 14,500-year-old occupation at 
Monte Verde (7). A coastal route was open 
by ~17,000 years ago, whereas luminescence 
dates suggest that the ice-free corridor was 
open and viable by 15,000 to 14,000 years 
ago (8), or slightly later (9). The latter route 
would have allowed little time for human 
migrants to reach and establish residency 
at terminal Pleistocene sites in Oregon (10), 
Florida (12), Peru (4), Chile (7), and other 
far-flung regions. 

The timing and viability of the ice-free 
corridor route remains uncertain, and 
arguments that the initial peopling of 
the Americas followed a terrestrial route 
hinge on its reconstruction. Questions 
remain about the coastal migration theory 
as well, and current evidence suggests 
that there likely were multiple dispersals, 
routes, and time frames for the peopling 
of the Americas. 
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Postpublication peer 
review: A crucial tool 


The current peer-review model used through- 
out science is not perfect (7). Whether it be 
the result of poor experimental design, acci- 
dent, or academic misconduct, publication of 
irreproducible, incorrect, or fabricated results 
occurs more frequently than it should [check 
Retraction Watch for recent examples (2)]. 
This leads not only to a waste of precious 
time and financial resources as scientists try 
to replicate or build on flawed research but 
also to damage to the reputation of science 
and to much larger societal impacts (such as 
the loss of public trust in science and loss of 
federal funding). 

An emerging online tool for combat- 
ing these issues is postpublication peer 
review (PPPR). PPPR sites such as F1000, 
ResearchGate, PubPeer, and PubMed 
Commons, as well as Science’s own eLetters, 
provide environments for user comments 
and discussion and are responsible for catch- 
ing flawed research that has slipped through 
traditional peer review (3). In addition to 
identifying fraudulent data, pointing out 
errors, and providing criticism (which gener- 
ally take the form of negative comments), 
PPPR also enables positive feedback (such 
as verifying the reproducibility of results), 
which is valuable but is currently provided 
much less frequently (4). This disparity 
likely stems from the reality that overworked 
scientists do not have time for activities that 
provide little to no recognition (5). However, 
most scientists already participate in infor- 
mal (offline) PPPR. We discuss the results of 
papers with our colleagues, present papers in 
group meetings, and critically analyze papers 
in journal clubs. With a little more effort, a 
formal record of our reviews (negative and 
positive) can be made online for the better- 
ment of science. 

There is a risk to publicly challenging the 
work of established scientists (in particular, 
younger scientists may face retribution), but 
it can be mitigated by providing feedback 
in a respectful, positive, and professional 
manner (4, 6). These are risks that need to 
be taken. The scientific community needs 
to take action to maintain the integrity 
of our published work. With continued 
implementation and development (7), 
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Gregory Isaac Peterson 
Seoul National University, Seoul, Republic of Korea. 
Email: goeterson@snu.ac.kr 


REFERENCES 


1. R.Smith, J. R. Soc. Med. 99, 178 (2006). 

2. Retraction Watch (http://retractionwatch.com). 

. P.Knoepfler, Trends Gent. 31,221 (2015). 

. H. Bastian, PLOS Med. 11, e1001772 (2014). 

. M. Markie, Insights 28,107 (2015). 

. E. Pain, Science Careers (2013); www.sciencemag.org/ 
careers/2013/04/interactive-peer-review-what-s- 
it-reviewers. 

7. J.A Teixeira da Silva, A. Al-Khativ, J. Dobranszki, Sci. Eng. 

Ethics 23,1213 (2017). 


aDorw 


10.1126/science.aas9490 


TECHNICAL COMMENT ABSTRACTS 
Editor's note: The Comment by DeSilva 
and Response by Rosas et al. were 
published online in the 9 March issue. 


Comment on “The growth pattern of 
Neandertals, reconstructed from a juvenile 
skeleton from El Sidrén (Spain)” 

Jeremy M. DeSilva 

Rosas et al. (Reports, 22 September 2017, p. 
1282) calculate El Sidrén J1 to have reached 
only 87.5% of its adult brain size. This finding 
is based on an overestimation of Neandertal 


brain size. Pairwise comparisons with a larger 
sample of Neandertal fossils reveal that it is 
unlikely that the brain of El Sidrén would have 
grown appreciably larger. 

Full text: dx.doi.org/10.1126/science.aar3611 


Response to Comment on “The growth pat- 
tern of Neandertals, reconstructed from a 
juvenile skeleton from El Sidrén (Spain)” 


Antonio Rosas, Luis Rios, Almudena 
Estalrrich, Helen Liversidge, Antonio 
Garcia-Tabernero, Rosa Huguet, Hugo 
Cardoso, Markus Bastir, Carles Lalueza- 
Fox, Marco dela Rasilla, Christopher Dean 
The comment by DeSilva challenges our 
suggestion that brain growth of the El 
Sidrén J1 Neandertal was still incomplete 
at 7.7 years of age. Evidence suggests that 
endocranial volume is likely to represent 
less than 90% adult size at El Sidrén as well 
as Neandertal male plus Krapina samples, 
in line with further evidence from endocra- 
nial surface histology and dural sinus 
groove size. 

Full text: dx.doi.org/10.1126/science.aar3820 


Comment on “Synthesis and characteriza- 
tion of the pentazolate anion cyclo-N,- in 
(N,),(H,0),(NH,),CI” 


Rong-Yi Huang and Heng Xu 

Zhang et al. (Reports, 27 January 2017, 

p. 374) reported synthesis of a cyclo-N,- ion 
putatively stabilized in a solid-state salt 
by hydrogen bonding from surrounding 
counterions. We performed theoretical 
calculations suggesting that HN, would 
be favored over the anion in the reported 
pentazolate salt via proton transfer. 

Full text: dx.doi.org/10.1126/science.aao3672 


Response to Comment on “Synthesis and 
characterization of the pentazolate anion 
cyclo-N,- in (N,),(H,0),(NH,),CI” 

Chao Jiang, Lei Zhang, Chengguo Sun, 
Chong Zhang, Chen Yang, Jun Chen, 
Bingcheng Hu 

Huang and Xu argue that the cyclo-N,- ion 
in (N.),(H,0),(NH,),Cl we described in our 
Report is theoretically unfavorable and is 
instead protonated. Their conclusion is 
invalid, as they use an improper method 

to assess the proton transfer in a solid 
crystal structure. We present an in-depth 
experimental and theoretical analysis of 
(N.),(H,0),(NH,),CI that supports the 
results in the original paper. 

Full text: dx.doi.org/10.1126/science.aas8953 
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Dissecting diphthamide 
biosynthesis 
Dong et al., p. 1247 


CREDITS: (TOP TO BOTTOM) M. DONG ET AL.; JEFF ROTMAN/ALAMY STOCK PHOTO 
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INDUCED SEISMICITY 
Injection depth matters 
for induced earthquakes 


Wastewater injection has induced 
earthquakes in Oklahoma, but 
the relative importance of opera- 
tional and geologic parameters 

in triggering such earthquakes is 
unclear. Hincks et al. developed 


SCIENCE sciencemag.org 


an advanced Bayesian network to 
determine the interplay between 
these parameters in Oklahoma. 
The injection depth above the 
crystalline basement was the 
most important parameter when 
considering the potential for 
release of seismic energy. This 
modeling strategy may provide a 
way to improve forecasts of the 


flict 
. However, 


Fishing may accidentally 
capture endangered species 
such as this green sea turtle. 


impact of proposed regulatory 
changes on induced seismicity. 
—BG 


Science, this issue p.1251 


SURFACE CHEMISTRY 


Watching graphene grow 
The growth of graphene on 
metal surfaces can be catalyzed 
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by mobile surface metal atoms. 
Patera et al. used a high-speed 
scanning tunneling microscope 
to image the growth of graphene 
islands on a nickel surface. High 
temperatures caused carbon 
to diffuse to the surface, where 
mobile nickel atoms catalyzed 
graphene growth on the edges 
of islands. Molecular dynamics 
and density functional theory 
calculations provide mechanistic 
insights into the reaction steps. 
—PDS 

Science, this issue p. 1243 


POLYMERS 
Mimicking the designs 
found in proteins 


Natural proteins combine a 
range of useful features, includ- 
ing chemical diversity, the 
ability to rapidly switch between 
preprogrammed shapes, and 
a hierarchy of structures. 
Panganiban et al. designed 
random copolymers with polar 
and nonpolar groups, using 
many of the features found in 
proteins (see the Perspective by 
Alexander-Katz and Van Lehn). 
Their structures could serve as 
“broad spectrum” surfactants, 
able to promote the solubili- 
zation of proteins in organic 
solvents and help preserve the 
functionality of proteins in aque- 
ous environments. —MSL 
Science, this issue p. 1239; 
see also p. 1216 


COGNITIVE SCIENCE 
The infant as philosopher 


Visual behaviors, such as a shift 
in one's gaze or a prolonged 
stare, can be diagnostic of 
internal thoughts. Cesana-Arlotti 
et al. used these measures to 
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demonstrate that preverbal 
infants can formulate a logical 
structure called a disjunctive 
syllogism (see the Perspective 
by Halberda). That is, if A or B 

is true, and A is false, then B 
must be true. Presenting infants 
with scenes where the outcome 
revealed B to be false evoked 
ooks of surprise. —GJC and 
AMS 


Science, this issue p. 1263; 
see also p.1214 


Call to action 
The developing brain initially 
makes more synapses than it 
needs. With further develop- 
ment, excess synapses are 
pruned away, leaving mature 
circuits. Synapses can be elimi- 
nated by microglia, which engulf 
and destroy them. Vainchtein et 
al. found that the microglia are 
called into action by astrocytes, 
supportive cells on which 
neurons rely. Astrocytes near a 
redundant synapse release the 
cytokine interleukin-33 (IL-33), 
which recruits microglia to the 
site. In mice, disruptions in this 
process, as caused by deficiency 
in IL-33, led to too many excit- 
atory synapses and overactive 
brain circuitry. —PJH 

Science, this issue p. 1269 


Robots reach out 
Unmanned aerial vehicles 
(UAVs) sometimes need to reach 
into confined spaces. Previous 
foldable robotic arms have 

been too flexible to be practical. 
Kim et al. created a modular, 
origami-inspired robotic arm 
with a locking mechanism based 


Flying robot 

with folded (left) S 
and extended an a 
(right) arm 

. a g- V4 
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on the idea that when two folds 
intersect at a right angle, only 
one can be folded at a time. This 
allowed tendon-based actuation 
by asingle electric motor and 
enabled a UAV to grasp objects 
in a narrow space and insert a 
camera among tree branches. 
—RLK 

Sci. Robot. 3, eaar2915 (2018). 


Lysosomes keep neuronal 
stem cells young 


An important consequence of 
aging is loss of regenerative 
capacity in stem cells, par- 
ticularly those of the nervous 
system. Leeman et al. isolated 
quiescent and activated stem 
cells from mice and compared 
their transcriptomes. The 
findings emphasize the role of 
large lysosomes in quiescent 
neuronal stem cells in which 
aggregated proteins accumu- 
late. Treatments that stimulated 
lysosomal function allowed aged 
quiescent stem cells to clear 
protein aggregates and restored 
the cells’ ability to be activated. 
Such restoration of stem cell 
function might alleviate com- 
promised proteostasis in aging. 
—LBR 


Science, this issue p. 1277 


Good vibrations for 
movement perception 


Because prostheses do not pro- 
vide physical feedback during 
movement, amputees may not 
feel that they are in full control 
of their bodily action. Marasco 
et al. developed an automated 
neural-machine interface 
that vibrates the muscles 
used for the control of 
prosthetic hands. This 
system instilled kines- 
thetic sense in amputees, 
allowing them to control 
prosthetic hand move- 
ments in the absence 
of visual feedback and 
increasing their sense of 
control. -MM 
Sci. Transl. Med. 10, 
eaa06990 (2018). 
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VOCAL BEHAVIOR 


Speaking in parts 


Edited by Caroline Ash 
and Jesse Smith 


he complexity of human speech is an anomaly 
among primates, where calls have typically 
been thought of as stereotyped and not vari- 
able. Recent work has shown that there is 
more learning and control present in nonhu- 
man primate vocalizations than previously believed, 
but adult calls have nevertheless seemed to be 
stable. Pomberger et al. interrupted calling marmo- 
sets with acoustic playbacks triggered by their own 
calls and found that the animals could extend their 
“phee” calls beyond normal lengths by restarting 
them rapidly after perturbation, but they could 
only alter their calls at periodic time points. These 
results suggest that the phee call is not one long 
stereotyped series, but rather is composed of small 
sequential segments, like human speech. —SNV 


Curr. Biol. 10.1016/j.cub.2018.01.070 (2018). 


Marmoset phee calls 
occur in units, 
like human speech. 


Predicting fecal 
transplant success 


Consumption of fecal matter 

is taboo, unless you have a 
life-threatening disease, such 
as Clostridium difficile infection. 
In these cases, fecal microbiota 


transplantation (FMT) is extraor- 


dinarily effective at restoring 
gut function to a healthier state. 
However, FMT does not work so 
well for other indications that 
are less strongly associated with 
the microbiota. We have a weak 
understanding of what factors 
allow microbial engraftment 

in the gut. Smillie et al. took 
genomic data from FMT-treated 
C. difficile patients to build a 
machine-learning statistical 


Published by AAAS 


model that tells us which bacte- 
rial strains will engraft. The main 
predictive factors are abundance 
of a strain in the donor and the 
species present in the patient. 
Donor strains whose species 
are also present in the patient 
always engraft. If the taxa are 
discordant, then it appears that 
microbial interactions prevent 
engraftment. —CA 
Cell Host Microbe 10.1016/ 
j.chom.2018.01.003 (2018). 


Secrets of CRISPR 


enzymes revealed 
CRISPR-Cas9 cleaves DNA, 
enabling genome editing. 
Surprisingly, three studies have 


sciencemag.org SCIENCE 
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CREDIT: A. SAPALA ET AL., ELIFE 10.7554/ELIFE.32794 (2018) 


concurrently uncovered an 
intrinsic RNA cutting activity of 
Cas9. Cas9 cuts endogenous 
RNA in some bacteria and, 
when programed, targets RNA 


in test tubes as well. Strutt et al. 


showed that Cas9 expression 
in a heterologous bacterium 
protects against infection by 
an RNA virus. Rousseau et al. 
showed that the activity can be 
switched off by virus-derived 
anti-Cas9 proteins. Dugar et al. 
profiled the cellular RNAs that 
are subjected to endogenous 
Cas9 targeting. Although how 
itis used to fend off intruders 
or regulate gene expression by 
bacteria remains to be inves- 
tigated, this newly identified 
activity of Cas9 provides an 
exciting tool in our increasing 


SCIENCE sciencemag.org 


arsenal for modulating RNA 
metabolism. —SYM 


eLife 10.7554/eLife.32724 (2018); 
Mol. Cell. 10.1016/j.molcel.2018.01.025, 
10.1016/j.molcel.2018.01.032 (2018). 


Tracking expanding 
dimensions of words 


Language can evolve by adding 
new meanings or “senses” to old 
words. Thus, the word “face” can 
refer to the body part or the front 
surface of a cliff. There are several 
theories regarding how new 
senses of words arise historically. 
Ramiro et al. used an algorithmic 
approach to study 5000 English 
words taken from a digital histori- 
cal dictionary that covers word 


usage from Anglo-Saxon times 
to the present. They found that 
one model—nearest-neighbor 
chaining—predominates, possibly 
because it minimizes the cogni- 
tive costs of communicating new 
ideas. —BJ 

Proc. Natl. Acad. Sci. U.S.A. 10.1073/ 

pnas.1714730115 (2018). 


Tumor sensing by 


the immune system 


The idea that innate lymphoid 
cells (ILCs) play a role in cancer is 
of interest because these immune 
cells respond to IL-33, a cytokine 
proposed to regulate cancer 
development. Saranchova et al. 
report that the subgroup called 
ILC2s may limit tumor growth 
rate and metastasis. In a mouse 
model engineered to lack ILC2s, 
lung cancer cells implanted into 
the leg enlarged more rapidly 
than in mice expressing ILC2s. 
Moreover, in the absence of ILC2, 
cancer cells spread to other 
tissues, including the lung, brain, 
and adrenal gland. The authors 
propose that IL-33 released by 
the cancer cells locally stimulates 
ILC2s, which in turn produce 
other cytokines that promote an 
immune response to cancer, limit- 
ing its growth and spread. —LC 
Sci. Rep. 10.1038/ 
s41598-018-20608-6 (2018). 


Plant puzzle patterns 
The epidermal cells of plants 
have a jigsaw-like appearance 


Jigsaw patterning in pl 
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when viewed microscopically. 
What is the function of this intri- 
cate pattern, and how does it 
arise? Sapala et al. suggest that 
the interlocking arrangement 
reduces mechanical stress in 
the cell wall—stress that arises 
from the turgor pressure from 
within. Simulation modeling 
and observations of several 
plant species and organs show 
that as growth proceeds, stress 
causes localized cellulose 
deposition, leading to small 
surface indentations in cell walls 
that gradually develop into the 
jigsaw pattern. —AMS 

eLife 10.7554/eLife.32794 (2018). 


Seeing the human hand 
Environmental temperature 
changes occur both naturally 
and because of human activity 
(anthropogenic global warming), 
and distinguishing those two 
types is difficult. Wills et al. 
present a method that can be 
used to make that distinction for 
Pacific sea surface temperatures, 
which allows them to identify 
the separate effects of global 
warming, the Pacific Decadal 
Oscillation (PDO), and the 
EI Nifio—Southern Oscillation. 
They do this on the basis of dif- 
ferences in time scale and spatial 
pattern, which enables them to 
circumvent the use of climate 
models. Among other results, 
they find that midlatitude PDO 
anomalies are more persistent 
than previously thought. —HJS 
Geophys. Res. Lett. 10.1002/ 
2017GLO076327 (2018). 
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TOPOLOGICAL PHOTONICS 
Topological protection 
for lasers 


Ideas based on topology, 
initially developed in mathemat- 
ics to describe the properties 
of geometric space under 
deformations, are now finding 
application in materials, elec- 
tronics, and optics. The main 
driver is topological protec- 
tion, a property that provides 
stability to a system even in the 
presence of defects. Harari et 
al. outline a theoretical proposal 
that carries such ideas over to 
geometrically designed laser 
cavities. The lasing mode is 
confined to the topological edge 
state of the cavity structure. 
Bandres et al. implemented 
those ideas to fabricate a 
topological insulator laser with 
an array of ring resonators. The 
results demonstrate a power- 
ful platform for developing new 
laser systems. —ISO 

Science, this issue p. 1230, p. 1231 


CIRCADIAN RHYTHMS 
Daily transcription 
cycling in the baboon 


Much of our knowledge about 
the important effects of cir- 
cadian rhythms in physiology 
comes from studies of mice, 
which are nocturnal. Mure 
et al. report transcriptional 
profiles from many tissues 
and brain regions in baboons 
over a 24-hour period (see the 
Perspective by Millius and Ueda). 
The results emphasize how 
extensive rhythmic expression is, 
with more than 80% of protein- 
coding genes involved. They 
also highlight unanticipated 
differences between the mouse 
and baboon in the cycling of 
transcripts in various tissues. 
The findings provide a compre- 
hensive analysis of circadian 
variation in gene expression for a 
diurnal animal closely related to 
humans. —LBR 

Science, this issue p. 1232; 

see also p.1210 
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HUMAN GENOMICS 
Hidden effects of 
Mendelian inheritance 


Identifying the determinate fac- 
tors of genetic disease has been 
quite successful for Mendelian 
inheritance of large-effect patho- 
genic variants. In these cases, 
two non- or low-functioning 
genes contribute to disease. 
However, Mendelian effects 
of lesser strength have gener- 
ally been ignored when looking 
at genomic consequences in 
human health. Bastarache et 
al. used electronic records to 
dentify the phenotypic effects 
of previously unidentified 
Mendelian variations. Their 
analysis suggests that individu- 
als with undiagnosed Mendelian 
diseases may be more preva- 
lent in the general population 
than assumed. Because of this, 
genetic analysis may be able to 
assist clinicians in arriving at a 
diagnosis. —LMZ 

Science, this issue p. 1233 


ENZYMOLOGY 
A quick freeze shows 
an enzyme’s secrets 


Organic radicals are chemi- 
cally useful in enzymatic 
reactions but are often hard to 
observe, owing to their short 
lifetimes. Dong et al. used 

rapid freeze-quench methods 
to trap two intermediates 
formed by a noncanonical 
radical S-adenosylmethionine 
(SAM) enzyme: a fragmented 
SAM molecule bound to the 
iron-sulfur cluster through an 
iron-carbon bond and a prod- 
uct-like radical. The structure of 
the SAM-bound enzyme reveals 
a noncolinear arrangement of 
carbon, sulfur, and iron atoms. 
The arrangement of bonds sug- 
gests that the organometallic 
intermediate may be created 
through a two-electron nucleo- 
philic mechanism. A subsequent 
radical intermediate is formed 
on the protein substrate and 
resolves by oxidation to form 
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the amino acid product diph- 
thamide. —MAF 
Science, this issue p. 1247 


NITROGEN CYCLE 
Stream physics 
set the limits 


A combination of physical trans- 
port processes and biologically 
mediated reactions in streams 
and their sediments removes 
dissolved inorganic nitrogen 
(DIN) from the water. Although 
stream chemistry and biology 
have been considered the domi- 
nant controls on how quickly DIN 
is removed, Grant et al. show 
that physics is what sets the lim- 
its on removal rates of nitrate (a 
component of DIN). Residence 
time in the hyporheic zone 
(the region below the sediment 
surface where groundwater and 
surface water mix) determines 
the maximum rate at which 
nitrate can be removed from 
stream water. Nevertheless, at 
local scales, chemistry and biol- 
ogy modify how closely to that 
maximum rate removal occurs. 
—HJS 

Science, this issue p. 1266 


MALARIA 
Sexual development 
in Plasmodium 


Malaria-causing parasites 
(Plasmodium) have complex life 
histories in the tissues of humans. 
For the most part, the parasites 
focus their efforts on replica- 

tion within the human host cells. 
However, occasionally, some 
replicating cells release gametes 
into the bloodstream, which are 
picked up by biting mosquitoes. 
Filarsky et al. discovered that the 
Plasmodium parasite keeps the 
production of gametes under 
tight epigenetic control using 
heterochromatin protein 1 (HP1). 
Plasmodium gametocytogenesis 
is initiated when HP1 is evicted 
from upstream of gamete-specific 
genes by gametocyte develop- 
ment 1 (GDV1) protein. GDV1 is 
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in turn regulated by its antisense 
RNA. What triggers GDV1 expres- 
sion remains unclear. Elucidating 
this pathway could provide a 
target for interrupting malaria 
transmission. —CA 

Science, this issue p. 1259 


CIRCADIAN RHYTHMS 
Chromosome dynamics 
and cellular clocks 


Many genes undergo daily or 
circadian changes in their rate of 
transcription. Kim et al. explored 
the mechanism by which the 
circadian clock is linked to 
chromosome dynamics (see the 
Perspective by Diettrich Mallet 
de Lima and Gondor). They 
used a chromosome conforma- 
tion capture technique (Hi-C) 
to identify the interactions of 
adjacent DNA fragments and 
determine how DNA looping 
that altered such interactions 
changed over daily cycles. The 
repressive transcription factor 
Rev-erba, which functions as 
part of the mammalian clock 
mechanism, appears to bind to 
chromatin and recruit a protein 
complex that evicts other pro- 
teins that enhance looping, thus 
favoring enhancer-promoter 
interactions. —LBR 

Science, this issue p. 1274; 

see also p.1212 


SINGLE-CELL ANALYSIS 
Mutation rates and 
effects in single cells 


Understanding the dynamics of 
mutations and the distribution of 
fitness effects is critical for most 
evolutionary models. Robert et al. 
used a single-cell technology to 
visualize the accumulation of new 
mutations. The method identifies 
DNA sequences with mispaired 
bases and small insertions or 
deletions caused by DNA replica- 
tion errors in living Escherichia 
coli cells. Following the fates 

of cells after mutation allowed 
for a precise quantification of 

the effects of new mutations. 


sciencemag.org SCIENCE 


Asmaller fraction of mutations 
were found to be deleterious than 
predicted previously from indirect 
observations. —LMZ 

Science, this issue p. 1283 


CANCER THERAPY 
Anew target induces 
synthetic lethality 


Identifying mutations that can 
be targeted to cause tumor cell 
death selectively in a process 
known as synthetic lethality has 
become a paradigm of cancer 
research. Loss of DNA repair 
can provide a selective growth 
advantage to tumor cells by 
enhancing mutation rates and 
driving tumor evolution. Ina 
Perspective, Higgins and Boulton 
discuss how the DNA damage 
response protein known as DNA 
polymerase 6 is increased in 
certain tumors, such as those 
found in breast and ovarian can- 
cers. This pathway represents an 
exciting anticancer target that 
could perhaps be combined with 
radiotherapy to improve thera- 
peutic response. —GKA 

Science, this issue p. 1217 


ECOLOGY 
How animals navigate 
infection risk 


Ecologists have long recognized 
how fear of predators affects ani- 
mal behavior. In a Perspective, 
Weinstein et al. highlight the 
importance of another factor: 
infection risk. Using examples 
from a range of animals, they 
show how the disgust triggered 
by potential sources of infec- 
tion affects animal behavior. 
Understanding how animals 
respond to the overlapping 
andscapes of predation and 
infection risk is crucial for pre- 
dicting how ecosystems respond 
to changing environments. —JFU 
Science, this issue p.1213 


IMMUNOLOGY 
A psoriasis target 
in Langerhans cells 


Psoriasis is an autoimmune 
skin condition that is linked to 


SCIENCE sciencemag.org 


the proinflammatory cytokine 
interleukin-17 (IL-17), which is 
produced by T cells in the skin. 
Zheng et al. found that p38a sig- 
naling in skin-resident dendritic 
cells (Langerhans cells) was 
important for the pathogenesis 
of psoriasis in a mouse model 

of the disease. p38a signaling 

in Langerhans cells stimulated 
the production of IL-23, which is 
critical for the development of 
L-17—producing T cells. Genetic 
deletion or pharmacological 
inhibition of p38a reduced skin 
inflammation in mice with estab- 
ished psoriatic disease. —ERW 


Sci. Signal. 11,eaao1685 (2018). 


INFECTIOUS DISEASE 
Structural insights 
into RSV 


Respiratory syncytial virus (RSV) 
causes respiratory infections 
associated with severe morbidity 
and mortality in infants, young 
children, and the elderly. No 
licensed vaccine against this 
disease exists. Fedechkin et al. 
describe cocrystal structures 
of the RSV G glycoprotein 
conserved central domain 
(CCD) bound by two different 
broadly neutralizing monoclonal 
antibodies (mAbs). Both mAbs 
bind to conformational epitopes 
on this highly conserved region. 
The RSV G CCD can activate the 
chemokine receptor CX3CRI1, 
and this activity can be blocked 
by binding of the mAbs. These 
findings provide insight into how 
neutralizing mAbs interact with 
the RSV G glycoprotein and may 
advance the development of 
RSV vaccines and therapeutics. 
—CNF 

Sci. /mmunol.3, eaar3534 (2018). 


NEUROSCIENCE 
MHCI suppresses relapse 


Relapse is a persistent prob- 
lem in the treatment of drug 
addiction and carries a great 
personal and societal cost. 
Murakami et al. discovered a 
molecular mechanism that 
suppresses relapse to cocaine 
addiction in mice. In a transgenic 
mouse model, cocaine self- 
administration reduced major 


histocompatibility complex 
class | (MHCI) levels in the 
brain's reward system, enhanced 
glutamate signaling, and led 
to behavioral relapse. Up- or 
down-regulating genes that 
influence the function of MHCI in 
dopamine neurons prevented or 
promoted this addictive profile. 
—KSL 

Sci. Adv. 10.1126/sciadv.aap7388 

(2018). 
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TOPOLOGICAL PHOTONICS 


Topological insulator laser: 


Theory 


Gal Harari,* Miguel A. Bandres,* Yaakov Lumer, Mikael C. Rechtsman, 
Y. D. Chong, Mercedeh Khajavikhan, Demetrios N. Christodoulides, Mordechai Segev{t 


INTRODUCTION: Topological insulators 
emerged in condensed matter physics and 
constitute a new phase of matter, with in- 
sulating bulk and robust edge conductance 
that is immune to imperfections and dis- 
order. To date, topological protection is 
known to be a ubiquitous phenomenon, oc- 
curring in many physical settings, ranging 
from photonics and cold atoms to acoustic, 
mechanical, and elastic systems. So far, how- 
ever, most of these studies were carried out 
in entirely passive, linear, and conservative 
settings. 


RATIONALE: We propose topological insu- 
lator lasers: lasers whose lasing mode exhib- 
its topologically protected transport without 
magnetic fields. Extending topological phys- 
ics to lasers is far from natural. In fact, lasers 
are built on foundations that are seemingly 
inconsistent with the essence of topological 
insulators: They require gain (and thus are 


Topological lasing mode 


non-Hermitian), they are nonlinear entities 
because the gain must be saturable, and they 
are open systems because they emit light. 
These properties, common to all lasers, cast 
major doubts on the possibility of harness- 
ing topological features to make a topological 
insulator laser. Despite this common mind- 
set, we show that the use of topological prop- 
erties leads to highly efficient lasers, robust 
to defects and disorder, with single-mode 
lasing even at conditions high above the laser 
threshold. 


RESULTS: We demonstrate that topological 
insulator lasers are theoretically possible and 
experimentally feasible. We consider two con- 
figurations involving planar arrays of coupled 
active resonators. The first is based on the 
Haldane model, archetypical for topological 
systems. The second model, geared toward 
experiment, constitutes an aperiodic array 
architecture creating an artificial magnetic 
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Topological insulator laser based on the Haldane model and its efficiency. 

(A) Planar honeycomb lattice of coupled microring resonators pumped at the 
perimeter. The topological lasing mode has unidirectional flux with almost-uniform 
intensity, which builds up as the mode circulates and drops when passing the 
output coupler. (B) Slope efficiency (in arbitrary units) versus disorder strength for 
three cases differing only in the Haldane phase (of the next-to-nearest neighbor 
coupling): a topological laser with the maximum gap (blue; Haldane phase of 2/2), 
one with a small topological gap (red; Haldane phase of x/8), and a topologically 
trivial laser with no gap (black; Haldane phase of 0). 
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field. We show that by introducing satura- 
ble gain and loss, it is possible to make these 
systems lase in a topological edge state. In 
this way, the lasing mode exhibits topolog- 
ically protected transport; the light propa- 
gates unidirectionally along the edges of the 
cavity, immune to scattering and disorder, 

unaffected by the shape 
of the edges. Moreover, 
Read the full article We show that the under- 
at http://dx.doi. lying topological proper- 
org/10.1126/ ties not only make the 
science.aar4003 system robust to fabrica- 
7 tion and operational dis- 
order and defects, they also lead to a highly 
efficient single-mode lasing that remains 
single-mode even at gain values high above 
the laser threshold. 

The figure describes the geometry and 
features of a topological insulator laser 
based on the Haldane model while adding 
saturable gain, loss, and an output port. 
The cavity is a planar honeycomb lattice of 
coupled microring resonators, pumped at 
the perimeter with a lossy interior. We show 
that under these conditions, lasing occurs at 
the topological edge mode, which has uni- 
directional flux and is extended around the 
perimeter with almost-uniform intensity. 
The topological cavities exhibit higher ef- 
ficiency than the trivial cavity, even under 
strong disorder. For the topological laser with 
a small gap, the topological protection holds 
as long as the disorder level is smaller than 
the gap size. 


DISCUSSION: The concept of the topologi- 
cal insulator laser alters current understand- 
ing of the interplay between disorder and 
lasing, and opens exciting possibilities at 
the interface of topological physics and laser 
science, such as topologically protected trans- 
port in systems with gain. We show here that 
the laser system based on the archetypal 
Haldane model exhibits topologically pro- 
tected transport, with features similar to 
those of its passive counterpart. This be- 
havior means that this system is likely to 
have topological invariants, despite the non- 
hermiticity. Technologically, the topological 
insulator laser offers an avenue to make many 
semiconductor lasers operate as one single- 
mode high-power laser. The topological in- 
sulator laser constructed from an aperiodic 
array of resonators was realized experimen- 
tally in an all-dielectric platform, as described 
in the accompanying experimental paper by 
Bandres e¢ al. 
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Topological insulator laser: 


Theory 


Gal Harari,’* Miguel A. Bandres,’* Yaakov Lumer,” Mikael C. Rechtsman,” 
Y. D. Chong,* Mercedeh Khajavikhan,” Demetrios N. Christodoulides,* Mordechai Segev'+ 


Topological insulators are phases of matter characterized by topological edge states that 
propagate in a unidirectional manner that is robust to imperfections and disorder. These 
attributes make topological insulator systems ideal candidates for enabling applications in 
quantum computation and spintronics. We propose a concept that exploits topological 
effects in a unique way: the topological insulator laser. These are lasers whose lasing mode 
exhibits topologically protected transport without magnetic fields. The underlying topological 
properties lead to a highly efficient laser, robust to defects and disorder, with single-mode lasing 
even at very high gain values. The topological insulator laser alters current understanding of 
the interplay between disorder and lasing, and at the same time opens exciting possibilities in 
topological physics, such as topologically protected transport in systems with gain. On the 
technological side, the topological insulator laser provides a route to arrays of semiconductor 
lasers that operate as one single-mode high-power laser coupled efficiently into an output port. 


opological insulators emerged in condensed 

matter physics (J-3) and constitute a new 

phase of matter, with insulating bulk and 

quantized and robust edge conductance. 

The prospect of observing topological ef- 
fects in non-electronic systems was first proposed 
and demonstrated in microwaves in gyro-optic 
crystals with broken time-reversal symmetry (4-6). 
The transition to optical frequencies required 
another conceptual leap. Theoretical proposals 
(7-10) were followed by the first experiments using 
artificial gauge fields—one based on a honeycomb 
lattice of helical waveguides (17) and the other 
based on an array of aperiodic coupled silicon 
microring resonators (12). Topological protection 
is now known to be a ubiquitous phenomenon, 
occurring in physical settings as varied as electro- 


A 


(OO Om Oo Oa Ok OO} 


(OOO OO) 


magnetic waves (6, 1J-16), cold atoms (17, 18), and 
acoustic, mechanical, and elastic systems (19-22). 
So far, however, most of these experiments have 
been carried out in entirely passive, linear, and 
conservative settings. 


Nonhermiticity in topological systems 


Lasers represent complex nonconservative sys- 
tems capable of exhibiting very rich dynamics. 
They are fundamentally non-Hermitian and non- 
linear entities that rely on saturable gain. Intro- 
ducing nonhermiticity to topological systems 
raises fundamental questions concerning whether 
non-Hermitian topological systems can exist at 
all (23), and if they do, how to define the topo- 
logical phases and their stability (24). Part of this 
controversy has recently been resolved, and it is 
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Fig. 1. Geometry and lasing mode in a topological insulator laser 
based on the Haldane model. (A) Cavity geometry (same for topological 
and trivial): a planar honeycomb lattice of coupled microring resonators. The 
cavity has unpumped (lossy) resonators (red), pumped resonators (blue), 
and an output channel (white). (B) The steady-state topological lasing mode 
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now known that one-dimensional non-Hermitian 
systems can have stationary zero-dimensional 
topological edge states (25, 26) and topological 
defect states (27-29), and these can even lase 
(29-32). However, being one-dimensional systems 
lasing from a localized defect, none of these systems 
can support transport via edge states. Very recently, 
lasing from a topological edge state in a photonic 
crystal subject to an external magnetic field—a 
photonic analog of the quantum Hall effect—was 
reported (33). That system used magneto-optic 
effects, which are very weak at optical frequencies, 
and accordingly produced only a narrow topolog- 
ical band gap (40 pm) within which it was claimed 
that lasing occurred (33). Clearly, it will be impor- 
tant to pursue new approaches in expanding the 
topological photonic band gap, and hence the de- 
gree of protection endowed to such structures. 
Equally important, in terms of applications, will be 
to follow an all-dielectric strategy that is by nature 
compatible with semiconductor laser technologies. 

Notwithstanding recent progress in topological 
photonics and lasing therein, the fundamental 
question is still open: Can topological protection 
of transport be incorporated into non-Hermitian, 
highly nonlinear, open systems such as lasers? 
Addressing this issue is at the heart of the topo- 
logical insulator laser, along with many new 
questions: Can a topological insulator include 
gain, which has no equivalent in condensed mat- 
ter physics? What happens to the stimulated 
emission under topologically protected transport? 
Can a topological insulator laser exist without a 
magnetic field, and if so, might it exhibit new 
features and improved laser action? 

Laser cavities are typically optimized to attain 
high Q factors, and they are always strongly af- 
fected by disorder, which can arise from manufac- 
turing imperfections, operational failure, stresses, 
etc. A prominent consequence of disorder is mode 
localization, which has dire implications in pho- 
tonics (34, 35). To a laser, disorder implies de- 
graded overlap of the lasing mode with the gain 
profile, lower output coupling, and multimode 
lasing, which together result in an overall reduced 
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of the topological cavity. The lasing mode is extended all around the 
perimeter of the cavity with almost uniform intensity, and its energy flux is 
unidirectional. The unidirectional energy flux can be detected by the intensity 
buildup as the mode circulates (clockwise) and by the sudden drop in 
intensity when passing (clockwise) the output coupler. 
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efficiency. These problems are further exacerbated 
in more involved arrangements such as laser arrays. 
Such laser array structures tend to lase with many 
modes simultaneously, with their modal structure 
(near-field and spectrum) varying with the pump- 
ing strength. Despite many methods suggested to 
control the emission pattern of laser diode arrays, 
current technology is still not able to make the 
lasers operate as a single high-power coherent 
laser source. Consequently, the most common ap- 
plication of laser arrays is their usage as pumps 
(instead of flashtubes) for solid-state lasers but 
not as coherent high-power laser sources. 


Topological insulator lasers 


Topological insulator lasers are lasers whose lasing 
mode exhibits topologically protected transport, 
such that the light propagates along the edges of 
the cavity in a unidirectional fashion, immune to 
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Fig. 2. Lasing modes in the topological and trivial cavities (Haldane 
model). (A to F) Steady-state lasing modes (colors indicate amplitude) 
for the topological and trivial cavities with no disorder [(A) and (D)], 
disorder [(B) and (E)], and a defect (a missing gain resonator at the 
perimeter) [(C) and (F)]. Being unidirectional and extended, with almost 
uniform intensity, the lasing mode in the topological cavity [(A) to (C)] 
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scattering and disorder, unaffected by the shape 
of the edges. We show that the underlying topo- 
logical properties lead to a highly efficient laser, 
robust to fabrication and to operational disorder 
and defects, that exhibits single-mode lasing even 
at gain values high above the laser threshold. 

We consider two possible configurations in- 
volving planar arrays of coupled, active resona- 
tors. The first is based on the Haldane model 
(36), an archetypical model for time reversal- 
broken topological insulators. The second relies 
on an aperiodic array architecture that creates 
an artificial magnetic field, as was demonstra- 
ted experimentally in passive systems (silicon) 
(8, 12, 13, 37). This is an all-dielectric system that 
has been realized with current semiconductor 
laser technology, as described in (38). 


Design of the topological laser cavity 


In the Haldane design (36), the resonators of 
the topological insulator laser are arranged in 
a honeycomb lattice (Fig. 1A). Each resonator is 
coupled to its nearest neighbors by a real hopping 
parameter ¢,, and to its next-nearest neighbors 
by a complex parameter fy exp(7), where @ is the 
Haldane flux parameter (39). The two sublattices 
of the honeycomb structure have identical on-site 
potentials. The passive Haldane model (no gain 
or loss, but t2 # 0) exhibits two phases: the trivial 
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phase when 9 is equal to 0 or a, and the topo- 
logical phase when 6 = 0, x. In the topological 
phase, edge states emerge with energies extending 
across the topological gap that is proportional 
to t) sin », reaching a maximum at = 1/2. To 
promote the lasing of the topological edge mode, 
we specifically design the honeycomb array to 
have zig-zag edges that have small penetration 
depth into the bulk. The evolution of the field 
of this laser system is governed by 


H T Tr itput 
aldane | / outpu’ 


where yw is a column vector encompassing the 
modal amplitudes of the array elements, and 
Ayaidane is the standard Haldane Hamiltonian 
(39), which depends on the resonance frequency 
of a single resonator wo, the hopping constants 
ty», and the Haldane flux . Here, y represents 
the loss in each resonator; this is assumed to be 
linear loss [as in all constant-wave lasers], al- 
though we simulated saturable loss as well, and 
our results were effectively the same. The third 
term in Eq. 1 represents optical gain via stimu- 
lated emission that is inherently saturable (/,,t). 
Here, P stands for the spatial profile of the 
pump, and Aoutpur describes the output coupler 
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exhausts all the pumped sites and couples strongly to the output 

coupler in all cases. The trivial cavity, lacking unidirectional extended lasing 
modes, couples only weakly to the output coupler even without disorder 
(D). The presence of disorder (E) or defects (F) causes the lasing mode of 
the trivial cavity to be highly localized, further reducing the efficiency 

and the coupling to the output port. 
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Fig. 3. The lasing pro- A 
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contrast, the trivial cav- 
ity has no protected 
transport, as there is no 
gap and the edge 
states are degenerate. 
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gain is increased further above the threshold gain (g;). The topological laser (E) maintains single-mode 
lasing, at least up to gain values of 10 times the threshold gain. By contrast, the trivial laser 
(F) becomes multimode, with three lasing modes, at a gain level just twice the threshold gain. 


(a semi-infinite chain of resonators, which is what 
makes this an open system) and its coupling to 
the cavity. To promote lasing of the edge modes, the 
gain (g) is provided only to the resonators on 
the perimeter (Fig. 1A). In general, g is a function 
of the frequency; however, in semiconductors 
the gain is broadband (relative to the inter-ring 
coupling constant), and hence it is assumed to be 
frequency-independent. 


Topological insulator laser based on the 
Haldane model 


We study the topological insulator laser by sim- 
ulating the full dynamics and by directly solving 
for the nonlinear lasing modes of Eq. 1 (39). We 
consider two cases: (i) a trivial arrangement 
where 6 = 0, and (ii) a topological array when the 
Haldane flux is » = 1/2. In the trivial case (o = 0), 
even without disorder, the first lasing mode is 
localized away from the output coupler (Fig. 2D) 
so as to minimize the power loss through the 
coupler. This strongly affects the lasing efficiency 
because the localized lasing mode does not use 
all the gain available around the perimeter and it 
is coupled only very weakly to the output. This 
adversarial effect is further enhanced in the pres- 
ence of disorder, given that it tends to highly 
localize modes. For example (Fig. 2E), in the pres- 


ence of disorder, the lasing mode of the trivial 
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cavity becomes further confined within just a few 
resonators. Finally, because the trivial lasing mode 
does not spatially deplete the gain, if pumped 
harder, multiple modes can reach threshold, 
giving rise to multimode operation (Fig. 3F). 

In the topological case ( = 1/2), the lasing 
mode possesses all the distinct characteristics of 
topological chiral edge states: It is extended all 
around the perimeter of the cavity with almost- 
uniform intensity, and its energy flux is unidirec- 
tional. The unidirectionality of the energy flux can 
be observed by noticing how the energy starts 
accumulating while moving clockwise around 
the perimeter of the array, and then abruptly 
drops after the output coupler (Fig. 1B). Even 
though the notion of topological invariants in 
non-Hermitian, nonlinear open systems (such 
as lasers) is still largely unexplored, the topo- 
logical nature of this array can be recognized 
by observing its dynamics. In this regard, it is 
instructive to monitor the evolution in the sys- 
tem, from below transparency to lasing. Figure 
3, C and D, shows the behavior of the complex 
frequency of the cavity modes as we increase 
the gain from zero to the threshold level. During 
this process, the frequency (real part) of each 
mode is virtually unchanged, equal to the eigen- 
frequencies of the closed Hermitian system (the 
topological insulator lattice), and the band gap of 
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the topological system (blue region in Fig. 3A) 
remains well defined and constant. Clearly, the 
topological band gap does not close in the pres- 
ence of gain (even high above the threshold level), 
and no topological phase transition occurs. This 
is a clear indication that the system retains its 
topological features even when it lases. More 
important, as shown in Fig. 3C, one of the topo- 
logical edge modes is the first in line to lase. More- 
over, even when we increase the gain (pumping) 
high above the threshold level, the lasing remains 
in asingle mode—the topological edge mode. This 
is because the topological lasing mode has almost 
uniform intensity all around the perimeter of the 
array, saturating the whole gain medium, hence 
suppressing all the other cavity modes and pre- 
venting them from lasing (Fig. 3E). 

To further examine the topological properties of 
the lasing mode, we study the system in the pres- 
ence of disorder and defects. We find that even in 
the presence of disorder, the topological insulator 
laser still operates in a topological edge mode 
(Fig. 2B). Even when a defect is introduced— 
for example, a malfunctioning microring at the 
perimeter—the topological edge mode is able 
to bypass the defect with minimal penetration 
into the bulk (Fig. 2C). 

It is instructive to highlight the role of topol- 
ogy in this topological insulator laser by compar- 
ing its efficiency to its trivial counterpart. The 
efficiency of lasers is universally defined by the 
so-called “slope efficiency”: the slope of the func- 
tion describing the output power as a function of 
gain above the threshold level. The calculated slope 
efficiencies (in steady state) as a function of the 
strength of the disorder are shown in Fig. 4A. 
When the cavity is in the trivial phase, = 0, the 
slope efficiency is poor and further decreases in 
the presence of disorder. In sharp contrast, when 
the laser is topological, » = 1/2, the lasing is very 
efficient and robust; it has a high slope efficiency 
(with a small variance) that remains high even 
at strong disorder levels. The topological protec- 
tion ceases only when the strength of disorder is 
comparable to the size of the topological band 
gap. To reaffirm that the high efficiency of this 
laser is due to its topological protection, we eval- 
uate the slope efficiency for structures with a 
smaller band gap (e.g., for o = 7/8). As seen in Fig. 
4A, the slope efficiency of this “small topological 
gap implementation” is initially high, but when 
the level of disorder is increased, it deteriorates 
much faster than the » = 1/2 case, because the 
disorder closes the smaller topological gap more 
easily. To further highlight the robustness of our 
topological insulator laser, we calculate the mean 
value of the slope efficiency as a function of dis- 
order strength and the two main parameters that 
control the size of the topological band gap: the 
Haldane flux (Fig. 4B) and the next-nearest neigh- 
bor coupling ¢, (Fig. 4C). When the strength of 
randomness is less than the size of the Haldane 
topological band gap (6\/3t, sin ), the slope 
efficiency stays high, and drops only when the 
disorder level exceeds the size of the topological 
band gap (Fig. 4, B and C). This is a clear indi- 
cation that the efficiency and robustness of the 
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topological insulator laser arise from its topo- 
logical properties (40). 


Topological insulator laser based on an 
aperiodic array of resonators 


Next, we study lasing in an aperiodic topolog- 
ical array of microring resonators, which was 
one of the two platforms first explored for re- 
alizing photonic topological insulators in optics 
(8, 12). This arrangement involves a lattice of 
coupled resonators with aperiodic couplers (39)— 
an architecture that can be implemented using 
standard semiconductor technologies (41, 42). 
Note that this system does not use magnetic 
fields, nor does it use any exotic materials (such 
as yttrium iron garnet). It relies only on present 
semiconductor laser technology. The aperiodic 
couplers establish an artificial gauge field, thus 
leading to behavior analogous to the quantum 
Hall effect (72). Because in the linear regime the 
system is reciprocal, both the clockwise (CW) and 
counterclockwise (CCW) modes in each microring 
resonator experience gauge fields with opposite 
signs. This, in turn, makes the overall cavity de- 
generate; that is, for any frequency supported by 
the CW modes, there is a corresponding CCW 
mode. In passive settings (no gain or loss), for 
reasonable experimental parameters, one can 
consider the CW and CCW as decoupled (8, 72). 
For a topological insulator laser based on this 
system, one must take into account that the CW 
and CCW modes inevitably interact with one 
another through the nonlinear effect of gain 
saturation and backscattering, both of which 
naturally occur in active media (43). 

This topological aperiodic laser array is simu- 
lated using an extended version of Eq. 1 (39). To 
account for a realistic setup in current semicon- 
ductor laser technology, we implement saturable 
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gain and explicit coupling between the CW and 
CCW modes through the saturable medium and 
backscattering. We use the parameters of recent 
experiments (42) and of the accompanying ex- 
perimental paper (38), which yield a topological 
band gap of ~1 nm, in an aperiodic ring array 
made of all-dielectric semiconductor laser mate- 
rials. A topological gap of this size ensures very 
high topological protection of transport in the 
laser cavity. We essentially observe the same fea- 
tures as in the Haldane model: The topological 
lasing mode is extended, uniform, and couples 
strongly to the output coupler, even in the pre- 
sence of disorder. In contrast, the topologically 
trivial aperiodic array, having no edge modes, 
suffers from localization of its lasing modes and 
displays strong multimode lasing and low output 
coupling (Fig. 5). It is important to stress that de- 
spite the reciprocity of the system and the inher- 
ent degeneracy of every ring and the presence of 
scattering and saturable gain (which cause lasing 
in both the CW and CCW directions), the topolog- 
ically protected features of the lasing modes pre- 
vail, and the efficiency of the topological array is 
much higher than that of the trivial case. The 
underlying mechanism can be clearly seen from 
the mode shapes in Fig. 5, B to E, where the 
mutual contribution of the two counterpropagat- 
ing (CW and CCW) edge modes with the same 
temporal frequency depletes the gain and enforces 
efficient single-frequency lasing. In a similar vein, 
the topological lasing modes exhibit high robust- 
ness against defects and disorder, despite the 
backscattering and the nonlinear coupling be- 
tween the CW and CCW modes. 


Concluding remarks 


As a result of its topological properties, the topo- 
logical insulator laser exhibits high efficiency, 


extreme robustness to defects and disorder, and 
high-efficiency single-mode lasing even at gain 
levels high above the threshold level. The inter- 
play between topology and nonhermiticity, espe- 
cially in nonlinear open systems such as lasers, 
raises many fundamental questions. We have 
shown that the laser system based on the arche- 
typal Haldane model exhibits topologically pro- 
tected transport, with features similar to its passive 
counterpart. This behavior means that there must 
be associated topological invariants, even though 
this system is non-Hermitian. Topologically pro- 
tected transport in lasing systems can change ex- 
isting paradigms by harnessing the topological 
features of the lasing mode to yield high-efficiency 
single-mode lasing even high above threshold. 
The systems proposed here are a proof of con- 
cept, with no attempt to optimize the integration 
of topological properties into a laser. The topo- 
logical design of the laser cavity can take on 
many different concrete designs, which may lead 
to new ideas and innovative applications. For ex- 
ample, the topological system can be based on 
topological network models of strongly coupled 
resonators (44), which can be realized with very 
small units. More generally, new geometries, 
wherein a medium with narrow-linewidth gain 
is matched to the topological gap, can be con- 
sidered. In a similar vein, topological insulator 
lasers could be designed from flexible gain media 
(organics), and operate under distortions, manip- 
ulations, and perhaps also extreme conditions. It 
may even be possible to integrate topological 
insulator lasers with sensors, antennas, and other 
photonic devices. The accompanying paper (38) 
describes the realization of a topological insu- 
lator laser without magnetic field. It is an all- 
dielectric system, based on the aperiodic array 
of ring resonators described here, fabricated with 


30 


= ie 
a 3 


Slope Efficiency 


° 


5 


0 02 04 O06 08 
2c Disorder Strength 


0 ml4 7/2 
@ Haldane Flux 


Fig. 4. Slope efficiency versus disorder in the topological and trivial 
laser systems (Haldane model). (A) Slope efficiency (in arbitrary units) 
versus disorder strength (measured in terms of its standard deviation) for the 
topological laser with the maximum gap (blue; @ = 2/2) and with a small 
topological gap (red; ® = x/8), and for the trivial laser with no gap (black; o = 0). 
Every point corresponds to the slope efficiency of a different realization of 
the disorder (1000 in total). Solid lines mark the mean values for each case. The 
topological cavities exhibit higher slope efficiency (blue line) than the trivial 
cavity, even under high levels of disorder. For the topological insulator laser with 


Harari et al., Science 359, eaar4003 (2018) 16 March 2018 


. 
0.8 
0.4 
(0) 
i?) 10) 0.15 


B3n/4 rT .05 0.4 

% NNN Coupling 

a small band gap, when the disorder level is increased to above the band gap 
size, the topological protection of this topological laser starts to break and 
its efficiency is markedly deteriorated (red line). (B and C) Mean value of slope 
efficiency (average over 1000 realizations) as a function of disorder and 
Haldane flux » (B) and as a function of disorder and next-nearest neighbor 
coupling ts (C). The dashed black lines depict the size of the band gap of the 
Haldane model at zero disorder, given by 6V3t2 sin o. Clearly, the slope 
efficiency of the topological cavity stays high as long as the disorder strength 
is lower than the size of the topological gap. 
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Fig. 5. Lasing modes in the topological and trivial cavities (coupled- 
resonators model). (A) Mean slope efficiency as a function of disorder 
strength (measured in terms of the standard deviation of the on-site energy) 
for the topological laser (blue) and the trivial laser (red). The simulation 
includes 5% backscattering on each site. The slope efficiency of the 
topological laser is higher and more robust up to disorder levels on the order 
of the topological band gap. (B, C, E, and F) Typical lasing modes (colors 
indicate amplitude as indicated in Fig. 1) for the topological [(B) and (E)] and 
the trivial [(C) and (F)] cavities. The first and second rows show the CW 
and CCW components, respectively. In the topological array, the CW (B) and 
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INTRODUCTION: Physical systems that ex- 
hibit topological invariants are naturally en- 
dowed with robustness against perturbations, 
as was recently demonstrated in many settings 
in condensed matter, photonics, cold atoms, 
acoustics, and more. The most prominent man- 
ifestations of topological systems are topological 
insulators, which exhibit scatter-free edge-state 
transport, immune to perturbations and dis- 
order. Recent years have witnessed intense efforts 
toward exploiting these physical phenomena in 
the optical domain, with new ideas ranging from 
topology-driven unidirectional devices to topo- 
logical protection of path entanglement. But 
perhaps more technologically relevant than 
all topological photonic settings studied thus 
far is, as proposed by the accompanying theo- 
retical paper by Harari et al., an all-dielectric 
magnet-free topological insulator laser, with 
desirable properties stemming from the topo- 
logical transport of light in the laser cavity. 


RATIONALE: We demonstrate nonmagnetic 
topological insulator lasers. The topological 
properties of the laser system give rise to single- 
mode lasing, robustness against fabrication de- 
fects, and notably higher slope efficiencies 
compared to those of the topologically trivial 
counterparts. We further exploit the properties of 
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the active topological platform by assembling topo- 
logical insulator lasers from S-chiral microreso- 
nators that enforce predetermined unidirectional 
lasing even in the absence of magnetic fields. 


RESULTS: Our topological insulator laser sys- 
tem is an aperiodic array of 10 unit cell-by-10 
unit cell coupled ring resonators on an InGaAsP 
quantum wells platform. The active lattice uses 
the topological architecture suggested in the 
accompanying theoretical paper. This two- 
dimensional setting is composed of a square 
lattice of ring resonators coupled to each other 
by means of link rings. The intermediary links 
are judiciously spatially shifted to introduce a 
set of hopping phases, establishing a synthetic 
magnetic field and two topological band gaps. 
The gain in this laser system is provided by opti- 
cal pumping. To promote lasing of the topolog- 
ically protected edge modes, we pump the outer 
perimeter of the array while leaving the interior 
lossy. We find that this topological insulator laser 
operates in single mode even considerably above 
threshold, whereas the corresponding topologi- 
cally trivial realizations lase in multiple modes. 
Moreover, the topological laser displays a slope 
efficiency that is considerably higher than that in 
the corresponding trivial realizations. We further 
demonstrate the topological features of this laser 


by observing that in the topological array, all sites 
emit coherently at the same wavelength, whereas 
in the trivial array, lasing occurs in localized re- 
gions, each at a different frequency. Also, by 
pumping only part of the topological array, we 
demonstrate that the topological edge mode al- 
ways travels along the perimeter and emits light 


through the output coupler. 
By contrast, when we pump 
Read the full article _ the trivial array far from the 
at http://dx.doi. output coupler, no light is 
org/10.1126/ extracted from the coupler 
science.aar4005 because the lasing occurs at 


stationary modes. We also 
observe that, even in the presence of defects, the 
topological protection always leads to more effi- 
cient lasing compared to that of the trivial coun- 
terpart. Finally, to show the potential of this 
active system, we assemble a topological system 
based on S-chiral resonators, which can provide 
new avenues to control the topological features. 


CONCLUSION: We have experimentally dem- 
onstrated an all-dielectric topological insulator 
laser and found that the topological features en- 
hance the lasing performance of a two-dimensional 
array of microresonators, making them lase in 
unison in an extended topologically protected 
scatter-free edge mode. The observed single 
longitudinal-mode operation leads to a consid- 
erably higher slope efficiency as compared to 
that of a corresponding topologically trivial sys- 
tem. Our results pave the way toward a new 
class of active topological photonic devices, such 
as laser arrays, that can operate in a coherent 
fashion with high efficiencies. 
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Topological insulator laser. (A) Top-view image of the lasing pattern (topological edge mode) in a 10 unit cell-by—10 unit cell array of topologically 
coupled resonators and the output ports. (B) Output intensity versus pump intensity for a topological insulator laser and its trivial counterpart. 
The enhancement of the slope efficiency is about threefold. Comparing the power emitted in the single mode of the topological array to that of 

the highest power mode in the trivial array, the topological outperforms the trivial by more than a factor of 10. (C) Emission spectra from a topological 
insulator laser and its topologically trivial counterpart. au, arbitrary units. 
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Physical systems exhibiting topological invariants are naturally endowed with robustness 
against perturbations, as manifested in topological insulators—materials exhibiting robust 
electron transport, immune from scattering by defects and disorder. Recent years have 
witnessed intense efforts toward exploiting these phenomena in photonics. Here we 
demonstrate a nonmagnetic topological insulator laser system exhibiting topologically 
protected transport in the cavity. Its topological properties give rise to single-mode lasing, 
robustness against defects, and considerably higher slope efficiencies compared to the 
topologically trivial counterparts. We further exploit the properties of active topological 
platforms by assembling the system from S-chiral microresonators, enforcing predetermined 
unidirectional lasing without magnetic fields. This work paves the way toward active topological 
devices with exciting properties and functionalities. 


opological insulators are a phase of matter 
that feature an insulating bulk while support- 
ing conducting edge states (1-3). Notably, 
the transport of edge states in topological 
insulators is granted topological protec- 
tion, a property stemming from the underlying 
topological invariants (2). For example, in two- 
dimensional (2D) systems, the ensued one-way 
conduction along the edge of a topological in- 
sulator is, by nature, scatter free—a direct out- 
come of the nontrivial topology of the bulk 
electronic wave functions (3). Although topo- 
logical protection was initially encountered in 
the integer quantum Hall effect (4), the field 
of topological physics developed rapidly after 
it was recognized that topologically protected 
transport can also be observed even in the ab- 
sence of a magnetic field (5, 6). This, in turn, 
spurred a flurry of experimental activities in a 
number of electronic material systems (7). The 
promise of robust transport inspired studies 
in many and diverse fields beyond solid-state 
physics, such as optics, ultracold atomic gases, 
mechanics, and acoustics (8-25). Along these 
lines, unidirectional topological states were ob- 
served in microwave settings (9) in the pres- 
ence of a magnetic field (the electromagnetic 
analog of the quantum Hall effect), and, more 
recently, topologically protected transport phe- 
nomena have been successfully demonstrated in 
optical passive all-dielectric environments by 
introducing artificial gauge fields (14, 15). 
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In photonics, topological concepts could lead 
to new families of optical structures and devices 
by exploiting robust, scatter-free light propaga- 
tion. Lasers, in particular, could directly benefit 
from such attributes [see the accompanying theo- 
retical paper (26)]. In general, laser cavities are 
prone to disorder, which inevitably arises from 
fabrication imperfections, operational degrada- 
tion, and malfunction. Specifically, the presence 
of disorder in a laser gives rise to spatial light 
localization within the cavity, ultimately resulting 
in a degraded overlap of the lasing mode with the 


Fig. 1. Topological 
insulator laser: Lattice 
geometry. (A) Micro- 
scope image of an active 
InGaAsP topological 

10 unit cell-by—10 unit 
cell microresonator array. 
(B) Scanning electron 
microscopy (SEM) image 
of the outcoupling 
grating structures used 
to probe the array at 

the orange-outlined 
locations indicated in (A). 
(C) Magnification of 

the blue-outlined area 
indicated in (A), showing 
a SEM micrograph 

of a unit cell comprised 
of a primary ring site 
surrounded by four 
identical intermediary 
racetrack links. (D) A 
schematic of the 
topological array when 


gain profile. This implies lower output coupling, 
multimode lasing, and reduced slope efficiency. 
These issues become acute in arrays of coupled 
laser resonators (used to yield higher power), in 
which a large number of elements is involved. 
Naturally, it would be of interest to exploit topo- 
logical features in designing laser systems that 
are immune to disorder. In this spirit, several 
groups have recently studied edge-mode lasing 
in topological 1D Su-Schrieffer-Heeger resonator 
arrays (27-29). However, being one-dimensional, 
they lase in a zero-dimensional defect state, which 
inherently cannot provide protected transport. 
Conversely, 2D laser systems can directly benefit 
from topological protection. Indeed, it was shown 
theoretically that it is possible to harness the 
underlying features of topological insulators in 
2D laser arrays, when lasing in an extended topo- 
logical state (26, 30, 31). As indicated there, such 
systems can operate in a single-mode fashion 
with high slope efficiencies, in spite of appreciable 
disorder. In a following development, unidirec- 
tional edge-mode lasing was demonstrated in a 
topological photonic crystal configuration involv- 
ing a yttrium iron garnet (YIG) substrate under 
the action of a magnetic field (32). In that system, 
lasing occurred within a narrow spectral band 
gap induced by magneto-optic effects. Clearly, it 
would be of interest to pursue magnet-free ap- 
proaches that are, by nature, more compatible with 
fabrication procedures and photonic integration 
involving low-loss components. In addition, such 
all-dielectric systems can prove advantageous 
in substantially expanding the topological band 
gap and, in doing that, bring the topological 
protection of photon transport to the level at 
which lasing is immune to defects and disorder. 

Here we report the first observation of topolog- 
ically protected edge-mode lasing in nonmagnetic, 
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pumped along the perimeter to promote lasing of the topological edge mode. 
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2D topological cavity arrays. We show that this 
topological insulator laser can operate in single 
mode, even considerably above threshold, with 
a slope efficiency that is significantly higher than 
that achieved in their corresponding trivial reali- 
zations. Moreover, we observe experimentally that 
the topological protection leads to more efficient 
lasing in the 2D array than in the trivial counter- 
part, even in the presence of defects. Finally, to 
show the potential of this active system, we as- 
semble a topological system based on S-chiral 
resonators, which can provide new avenues to 
control the topological features. 


Design of the topological insulator laser 


We fabricated a 10 unit cell-by-10 unit cell 
coupled ring-resonator array on an active plat- 
form involving vertically stacked 30-nm-thick 
InGaAsP quantum wells [see (33), Fig. 1A]. We 
coupled the array to a waveguide that acts as 
an output coupler, which allowed us to inter- 
rogate the system using outcoupling gratings 
(Fig. 1B, corresponding to the yellow framed 
regions in Fig. 1A). The active lattice investigated 
here uses a topological architecture suggested in 
(26), which is based on adding gain and loss to 
the topological passive silicon platform dem- 
onstrated in (15). This 2D setting comprises a 
square lattice of ring resonators, which are 
coupled to each other through link rings (Fig. 1, 
A and C). The link rings are designed to be 
antiresonant to the main ring resonators. In 
this all-dielectric design, the intermediary links 
are judiciously spatially shifted along the y axis, 
with respect to the ring resonators, to introduce 
an asymmetric set of hopping phases. The phase 
shift is sequentially increased along the y axis in 
integer multiples of +270, designed here to be 
o. = 0.25, where a is proportional to the equivalent 
magnetic flux quanta passing through a unit cell. 
In this way, a round trip along any plaquette 
(consisting of four rings and four links) results in 
a total accumulated phase of +220, where the 
sign depends on the direction of the path along 
this unit cell. This provides the lattice with a syn- 
thetic magnetic field and establishes two topo- 
logically nontrivial band gaps. The cross section 
of each ring (500-nm width and 210-nm height) 
is designed to ensure single transverse-mode con- 
ditions at a wavelength of operation of 1550 nm 
(33). The nominal separation between the ring- 
resonators and off-resonant links is 150 nm, thus 
leading to two frequency band gaps, each having 
a width of 80 GHz (0.64 nm). The spectral size 
of the two band gaps was obtained by experimen- 
tally measuring the frequency splitting (0.8 nm) 
in a binary system of primary resonators, linked 
by means of an intermediate racetrack ring [see 
(33), part 9]. To promote protected edge-mode 
lasing, we optically pumped only the outer perim- 
eter of this array at 1064 nm with 10-ns pulses 
(Fig. 1D). This was achieved using a set of ap- 
propriate amplitude masks [see (33), part 1]. The 
intensity structure of the lasing modes was cap- 
tured by using an InGaAs infrared camera, and 
their spectral content was then analyzed by using 
a spectrometer with an array detector (33). In what 
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Fig. 2. Slope efficiencies and associated spectra of topological and trivial laser arrays. 

(A) Output intensity versus pump intensity for a 10 unit cell-by-10 unit cell topological array with 
a = 0.25 and its corresponding trivial counterpart (a = 0). In this experiment, the enhancement 

of the slope efficiency is about threefold. (B) Emission spectra from a trivial and a topological array 
when pumped at 23.5 kW/cm*. au, arbitrary units. (C and D) Evolution of the spectrum as a function 
of the pumping intensity for (C) topological and (D) trivial arrays. Single-mode, narrow-linewidth 


lasing in (C) is clearly evident. 


follows, we compare the features of the topological 
insulator lasers (o. = 0.25) with those of their trivial 
counterparts (a = 0) under various conditions. 


Studying the features of the topological 
insulator laser 


The edge mode can be made to lase by pumping 
the boundary of the topological array (26). In this 
case, a clear signature of topological lasing would 
be highly efficient single-mode emission, even at 
gain values high above the threshold. To observe 
these features in experiments, we pumped the 
perimeter of the topological and trivial arrays 
and measured the lasing output power (integrated 
over the two outcoupling gratings) and its spectral 
content. The light-light curves measured for the 
topological and the trivial arrays (Fig. 2A) clearly 
show that the topological system lases with a 
higher efficiency than its trivial counterpart. From 
their measured spectra (Fig. 2, B and D), we 
observe that the topological arrays remain single 
moded over a wide range of pumping densities 
(Fig. 2C), whereas the trivial arrays (tested over 
multiple samples) always emit in multiple wave- 
lengths with considerably broader linewidths 
(Fig. 2D). Importantly, if we only compare the 
power emitted in the dominant (longitudinal) 
mode of the topological array to the mode with 


16 March 2018 


the highest power in the trivial array, the topo- 
logical laser outperforms the trivial one by more 
than an order of magnitude. This difference in 
performance is attributed to the physical prop- 
erties of the topological edge modes. The trivial 
array suffers from several drawbacks. First, the 
trivial lasing modes extend into the lossy bulk, 
thus experiencing suppressed emission. Second, 
the trivial lasing modes try to avoid the output 
coupler so as to optimize their gain. And finally, 
because of intrinsic disorder in fabrication, the 
lasing mode localizes in several different parts 
of the trivial lattice, each lasing at a different 
frequency, thereby giving rise to a multimode 
behavior. Conversely, apart from a weak expo- 
nential penetration into the bulk, in topological 
arrays the edge states are strongly confined to 
the edge. Moreover, because they are forced to 
flow around the perimeter, they are always in 
contact with the output coupler. Finally, because 
of its inherent topological properties, the lasing 
edge mode does not suffer from localization, and 
therefore it uniformly extends around the perim- 
eter (in a single mode), using all the available 
gain in the system by suppressing any other 
parasitic mode. 

To demonstrate that these active lattices ex- 
hibit topological features, we compared their 
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Fig. 3. Lasing characteristics of topological lattices versus those of 
their corresponding trivial counterparts under several pumping 
conditions. (A to D) Trivial and (E to H) topological lattices. Lasing in a 
(B) trivial and (F) topological array when their full perimeter is selectively 
pumped. (A) and (E) represent the spectral content as obtained from 
specific edge sites of the arrays depicted in (B) and (F), correspondingly. 
Notice that the topological lattice remains single moded, whereas the 
trivial one emits in multiple modes. Lasing transport in a (D) trivial 

array and (H) topological lattice when the right side is pumped. The lasing 
edge mode in (H) travels (blue arrow) along the unpumped perimeter 


and exits mostly through the left output grating, whereas the lasing 
mode in (D) never reaches the output coupler. (C) and (G) present 

similar results when the pumping region at the right side is further 
extended into the array. In (G), the lasing edge mode again 

travels all the way to the output port, whereas the lasing modes 

of the trivial lattice never reach the extracting ports in (C). This 

proves that the topological mode travels around the perimeter and 
always reaches the output port, whereas the lasing modes of the 

trivial lattice are stationary. The pumping conditions are shown above the 


lasing response against that of their trivial 
counterparts (o = 0) when their periphery is 
pumped. The emission intensity profiles ob- 
tained from these two systems are shown in 
Fig. 3, B and C. To check whether the lasing 
modes are extended or localized around the perim- 
eter of the lattice, we measured the spectrum 
of the light emitted from different sites around 
the arrays (Fig. 3, A and E). For the trivial array, 
we observed that the spectrum varies around the 
lattice, with emission occurring over a wide wave- 
length range, spanning from 1543 to 1570 nm, as 
shown in Fig. 3A. This is an indication that the 
trivial array lases in localized domains, each one 
at a different frequency. By sharp contrast, in the 
topological array, all sites emit coherently at the 
same wavelength (Fig. 3E). Such lasing, in a single 
extended topological edge mode, is a direct mani- 
festation of topologically protected transport. These 
results are consistent with those presented in Fig. 2. 
The full spectra of Fig. 3, A and E, are given in (33). 
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labeled panels. 


Fig. 4. Robust behavior of the lasing edge 
mode with respect to defects in a 
topological array. (A and B) Lasing response 
of a (A) topological and (B) trivial array in the 
presence of two defects intentionally inserted 
on the periphery, under the same pumping 
intensity. Note that the edge mode transport in 
(A) (blue arrow) bypasses the defects, 
whereas in its trivial counterpart, the lasing 
occurs from the three separate sections. 


Topological transport in these structures was 
further investigated by selectively pumping the 
lattice. First, we pumped only one edge of the 2D 
array, as depicted in Fig. 3D (inset). Under these 
conditions, the lasing mode in the trivial system is 
confined to the pumped region (Fig. 3D). In this 
arrangement, the emission is heavily suppressed 
both in the bulk as well as along the perimeter, 
and, consequently, no light is extracted from any 
of the output grating couplers. By contrast, for 
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the topological array, even when only one side is 
pumped, the edge mode flows along the periph- 
ery, finally reaching the output coupler, as shown 
in Fig. 3H. In this case, only one output coupler 
grating emits strongly. This indicates that the lasing 
mode that reaches the output coupler has a definite 
chirality in each ring. Given that the emission is in a 
single mode, one can conclude that lasing takes 
place in only one topological mode. To show that in 
the topological case, it is the edge mode that lases, 
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Fig. 5. Topological active array involving chiral S-microresonator 
elements. (A) SEM image of a 10 unit cell-by-10 unit cell topological 
array. The primary resonators feature an internal S-bend for enforcing, 
in this case, a right spin, whereas the intermediate link design is the 
same as in Fig. 1C. (B) A magnification of the SEM micrograph in (A) of 
the basic elements involved. (C) Field distribution in an individual 


whereas in the trivial case the bulk states lase, we 
expanded the pumping region at the right edge 
(Fig. 3C, inset). In the trivial case, even though 
pumping over a larger area is now provided, still 
no laser light reaches the output ports (Fig. 3C). 
This means that there are no traveling edge 
modes that could reach the output ports. Con- 
versely, for the topological array, the lasing edge 
mode reaches the output coupler with a fixed 
chirality within each ring (Fig. 3G). This shows 
that the topological lasing mode extends around 
the perimeter and travels all the way to the 
output ports, whereas the lasing in the trivial 
case occurs in stationary localized modes. In this 
vein, we tested multiple samples and found that 
the same features consistently emerged in a 
number of different designs (different resonance 
frequencies, couplings, etc.) in a universal manner. 
A video showing the behaviors of partially pumped 
topological and trivial arrays are provided as 
supplementary materials [see (33), part 10] and 
more features associated with these arrays can 
be found in [(33), parts 3 and 5]. 


Introducing defects 


Next, we studied lasing in a topological structure 
and in its trivial counterpart, in the presence of 
defects, which are intentionally introduced into 
the structures. We removed specific microrings 
along the perimeter, where pumping is provided 
(i.e., we remove two gain elements). Figure 4 
shows the light emission from these two types 
of structures. These results demonstrate that in 
a topological system (Fig. 4A), light is capable of 
bypassing the defects by penetrating into the 
bulk and displaying lasing in an extended edge 
mode of almost uniform intensity. Conversely, 
the intensity of the emitted light in the trivial 
structure is considerably suppressed (Fig. 4B), 
and the defects effectively subdivide the perim- 
eter into separate regions that lase indepen- 
dently (both measurements were performed at 
identical pump power levels). Hence, the topo- 
logical insulator laser is robust against defects, 
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the extracting ports. 


even when introduced into the gain regions. [Fur- 
ther evidence of this robustness in the presence 
of disorder is given in (33), part 8.] 


Laser based on an array of chiral 
S-bend elements 


Having established the underlying concepts of 
the topological insulator laser and the promise 
it holds for exploring new aspects of topological 
physics, specifically in active media, it is interest- 
ing to discuss some of the directions to which 
these ideas can lead. As an example, embarking 
on fundamental aspects such as Maxwell’s reci- 
procity in the presence of nonlinearity, as well 
as on potential applications, we modified the 
individual resonators in the topological insula- 
tor laser so as to break the symmetry between 
the clockwise (CW) and counterclockwise (CCW) 
modes in each ring. Instead of using conven- 
tional rings, we used a special S-bend design (34) 
for each primary cavity element in the topological 
lattice (Fig. 5, A and B). The intermediary links 
remained the same as in the previous designs 
(Fig. 1A). In this system, each laser microresonator 
selectively operates in a single spinlike manner, 
i.e., in either the CW or the CCW direction, by 
exploiting gain saturation and energy recircula- 
tion among these modes. The S-chiral elements 
involved, in the presence of nonlinearity (gain 
saturation) and the spatial asymmetry of the S- 
bends, add unidirectionality to the topological 
protection of transport. In the experiments de- 
scribed in Fig. 5, we observe suppression of 
more than 12 dB between the right- and left-hand 
spins in each resonator [see (33), part 6]. Finite- 
difference time-domain simulations also indicate 
that the differential photon lifetime between the 
right and left spins in these S-bend cavities is 
about 3 ps, corresponding to an equivalent loss 
coefficient of 10 cm™ [more details are outlined 
in (33), part 7]. The field distribution in the prev- 
alent spinning mode in these active S-resonators 
is shown in Fig. 5C, featuring a high degree of 
power recirculation through the S-structure that 
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S-element, as obtained from finite-difference time-domain simulations. 
(D) Measured intensity profile associated with the lasing edge mode 

in a topological array with a = 0.25. In this system, the perimeter is 
selectively pumped, and the energy in each ring circulates in a 
counterclockwise manner, as indicated by the radiation emerging from 


is responsible for the spinlike mode discrimina- 
tion. The corresponding intensity profile as- 
sociated with this unidirectional edge-mode 
energy transport is shown in Fig. 5D. In this case, 
energy is predominantly extracted from only 
one of the two outcoupling gratings (with a 
~10-dB rejection ratio), which is an indication of 
unidirectional energy flow in the rings of this 
topological array. 


Discussion 


Our all-dielectric topological insulator laser ex- 
ploits its topological features to enhance the 
lasing performance of a 2D array of micro- 
resonators, making them lase in unison in an 
extended topologically protected scatter-free edge 
mode. The observed single longitudinal-mode 
operation leads to a considerably higher slope 
efficiency as compared to a corresponding topo- 
logically trivial system. The systems described 
here are based on contemporary fabrication 
technologies of semiconductor lasers, without 
need for magnetic units of exotic materials. Our 
results provide a route for developing a new 
class of active topological photonic devices, 
especially arrays of semiconductor lasers, that 
can operate in a coherent fashion with high 
slope efficiencies. 
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CIRCADIAN RHYTHMS 


Diurnal transcriptome atlas of 
a primate across major neural 
and peripheral tissues 


Ludovic S. Mure, Hiep D. Le, Giorgia Benegiamo, Max W. Chang, Luis Rios, 
Ngalla Jillani, Maina Ngotho, Thomas Kariuki, Ouria Dkhissi-Benyahya, 


Howard M. Cooper,* Satchidananda Panda* 


INTRODUCTION: The interaction among cell- 
autonomous circadian oscillators—daily cycles 
of activity-rest and feeding-fasting—produces 
diurnal rhythms in gene expression in almost 
all animal tissues. These rhythms control the 
timing of a wide range of functions across dif- 
ferent organs and brain regions, affording 
optimal fitness. Chronic disruption of these 
rhythms predisposes to and are hallmarks 
of numerous diseases and affective disorders. 


RATIONALE: Time-series gene expression 
studies in a limited number of tissues from 
rodents have shown that 10 to 40% of the ge- 
nome exhibits a ~24-hour rhythm in expression 
in a tissue-specific manner. However, rhythmic 
expression data from diverse tissues and brain 
regions from humans or our closest primate 
relatives is rare. Such multitissue diurnal gene 
expression data are necessary for gaining mech- 
anistic understanding of how spatiotemporal 
orchestration of gene expression maintains 
normal physiology and behavior. We used a 


RNA sequencing technique to assess gene ex- 
pression in major tissues and brain regions 
from baboons (a primate closely related to hu- 
mans) housed under a defined 24-hour light- 
dark and feeding-fasting schedule. 


RESULTS: We assessed gene expression in 64 
different tissues and brain regions of male ba- 
boons, collected every 2 hours over the 24-hour 
day. Tissue-specific transcriptomes in baboon 
were comparable with that from humans (Hu- 
man GTEx data set). We detected >25,000 ex- 
pressed transcripts, including protein-coding 
and -noncoding RNAs. Nearly 11,000 genes were 
commonly expressed in all tissues. These univer- 
sally expressed genes (UEGs) encoded for basic 
cellular functions such as transcription, RNA 
processing, DNA repair, protein homeostasis, 
and cellular metabolism. The remainders were 
expressed in distinct sets of tissues, with ~1500 
genes expressed exclusively in a single tissue. 
Rhythmic transcripts were found in all tis- 
sues, but the number of cycling transcripts varied 


from ~200 to >3000 in a given tissue, with only 
limited overlap in the repertoire of rhythmic 
transcripts between tissues. Of the 11,000 UEGs, 
the vast majority (96.6%) showed 24-hour rhyth- 
micity in at least one tissue. A majority (>80%) 
of the 18,000 protein-coding genes detected also 
exhibited 24-hour rhythms in expression. The 

most enriched rhythmic 
transcripts across tissues 
Read the full article Were core clock compo- 
at http://dx.doi. nents and their immediate 
org/10.1126/ output targets. However, 
science.aao0318 their relative abundance 
and robustness of daily 
rhythms varied across tissues. Considered at 
the organismal level, global rhythmic transcrip- 
tion in 64 tissues organized into bursts of peak 
transcription, during early morning and late 
afternoon (when 11,000 transcripts reach their 
peak level). By contrast, during a relative “qui- 
escent phase” in early evening that coincides 
with the onset of sleep and no food intake, only 
700 rhythmic transcripts reach their peak ex- 
pression level. 


CONCLUSION: The daily expression rhythms 
in >80% of protein-coding genes, encoding di- 
verse biochemical and cellular functions, con- 
stitutes by far the largest regulatory mechanism 
that integrates diverse biochemical functions 
within and across cell types. From a translational 
point of view, rhythmicity may have a major 
impact in health because 82.2% of genes coding 
for proteins that are identified as druggable tar- 
gets by the U.S. Food and Drug Administration 
show cyclic changes in transcription. 


The list of author affiliations is available in the full article online. 
*Corresponding author. Email: howard.cooper@inserm.fr 
(H.M.C.); satchin@salk.edu (S.P.) 
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Spatiotemporal gene expression atlas of a primate. (Left) Gene expression analysis across 64 tissues of a diurnal primate sampled over the 
24-hour day shows that 82% of protein-coding genes are rhythmic in at least one tissue. (Right) Rhythmic expression is tissue-specific and 
confers an additional layer of regulation and identity to the transcriptome of a given tissue. 
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CIRCADIAN RHYTHMS 


Diurnal transcriptome atlas of 
a primate across major neural 
and peripheral tissues 


Ludovic S. Mure,’ Hiep D. Le,’ Giorgia Benegiamo,’ Max W. Chang,” Luis Rios," 
Ngalla Jillani,? Maina Ngotho,” Thomas Kariuki,® Ouria Dkhissi-Benyahya,* 


Howard M. Cooper,** Satchidananda Panda’* 


Diurnal gene expression patterns underlie time-of-the-day—specific functional 
specialization of tissues. However, available circadian gene expression atlases of a few 
organs are largely from nocturnal vertebrates. We report the diurnal transcriptome of 
64 tissues, including 22 brain regions, sampled every 2 hours over 24 hours, from 

the primate Papio anubis (baboon). Genomic transcription was highly rhythmic, with up 
to 81.7% of protein-coding genes showing daily rhythms in expression. In addition to 
tissue-specific gene expression, the rhythmic transcriptome imparts another layer 

of functional specialization. Most ubiquitously expressed genes that participate in essential 
cellular functions exhibit rhythmic expression in a tissue-specific manner. The peak phases 
of rhythmic gene expression clustered around dawn and dusk, with a “quiescent period” 
during early night. Our findings also unveil a different temporal organization of central and 
peripheral tissues between diurnal and nocturnal animals. 


aily cycles in behavior, physiology, and 

metabolism are presumed to arise from 

coordinated gene expression rhythms in 

multiple tissues and brain regions. Gene 

expression rhythms have been cataloged 
in a handful of tissues in model organisms (J-3). 
Among these, the mouse, which is nocturnal and 
evolutionarily diverged from humans 75 million 
years ago, is considered as a representative model 
for mammalian gene expression rhythms. How- 
ever, these data sets are primarily collected from 
the laboratory mouse (Mus musculus), which is 
active mainly at night, generally fed ad libitum, 
and has fragmented daytime sleep. Although stan- 
dard laboratory mouse strains do not produce 
melatonin, wild mice secrete both melatonin and 
cortisol during the night time. By contrast, diurnal 
primates, including humans, are active and eat 
discrete meals during the daytime, produce mel- 
atonin at night during their phase of consolidated 
sleep, and show a daytime peak in corticosteroids 
(4). Despite these differences in timing of behav- 
ior, a study of deoxy-glucose uptake has led to 
the view that the phase of the master circadian 
oscillator in the suprachiasmatic nucleus (SCN) 
is similar in nocturnal rodents and diurnal pri- 


'Regulatory Biology Laboratory, Salk Institute for Biological 
Studies, 10010, North Torrey Pines Road, La Jolla, CA 
92037, USA. “Department of Medicine, University of 
California, San Diego, 9500 Gilman Drive, La Jolla, CA 
92093, USA. “Institute of Primate Research (IPR), National 
Museums of Kenya, Nairobi, Kenya. “Université Lyon, 
Universite Claude Bernard Lyon 1, Inserm, Stem Cell and 
Brain Research Institute U1208, 69500 Bron, France. 
*Corresponding author. Email: howard.cooper@inserm.fr (H.M.C.); 
satchin@salk.edu (S.P.) 


Mure et al., Science 359, eaao0318 (2018) 


mates (5). However, limited gene expression data 
from human blood (6) and post mortem cortex 
samples (7) have indicated that expression, phase, 
and amplitude of daily gene expression rhythms 
in primates differ from that in mice. We there- 
fore analyzed diurnal gene expression profiles of 
768 samples across 64 different central and pe- 
ripheral tissues (Fig. 1 and table S1) from young 
male baboons (Papio anubis), a close primate rela- 
tive of humans (diverged 24 million years ago). 


Transcriptional complexity across tissues 


Animals housed individually in seminatural con- 
ditions under a 12 hours light-12 hours dark cycle 
showed highly stable and synchronous phases 
of activity, body temperature, and cortisol con- 
centration in the blood across the 24-hour cycle 
(Fig. 2A and fig. S1, A to E). Tissues were collected 
every 2 hours across the 24-hour day (ZTO, -2, 
-4,, -6, -8, -10, -12, -14, -16, -18, -20, and -22, where 
ZT0 is the time when light is ON and ZT12 is 
when light is OFF) and were flash frozen within 
2 hours of collection. Poly A* RNA” sequencing 
performed on an Illumina platform (8) produced 
an average of 15.4 million 50-base pair single-end 
(SE50) mapped reads per sample (table S2). The 
sequence reads were mapped to the P. anubis 
genome data set (PapAnu2.0, International Nu- 
cleotide Sequence Database Collaboration Assem- 
bly, GenBank Assembly ID GCA_000264685, 
version 88) and annotated with Ensembl (Full 
genebuild, released July 2014). This ensemble an- 
notation comprises 19,210 protein-coding genes 
(PCGs), 9272 noncoding RNAs [including 5888 
small nuclear RNA (snRNA), 1669 long non- 


16 March 2018 


coding RNA (IncRNA), and 1715 miscellaneous 
RNA (miscRNA)] and 720 pseudogenes. At a 
threshold of 0.1 FPKM (fragments per kilobase 
of transcript per million mapped reads) for gene 
expression (9), 97.3% of PCGs (18,684) and 89.6% 
of IncRNAs (1495) were detected in at least one 
organ (Fig. 2B). Principal component analyses 
of transcriptome from all 768 samples revealed 
multidimensional clustering or hierarchical clus- 
tering of samples from similar tissues and tissue 
types (Fig. 2E and fig. S2, A and B). Overall, the 
relative distance between tissue clusters was sim- 
ilar to that observed for comparable samples in 
the Nonhuman Primate Reference Transcriptome 
Resource database (13 tissues from one female 
baboon) (fig. S2C) (10) and in 41 human tissue 
transcriptomes (9, 11). Although brain regions 
showed little separation in multidimensional 
clustering, hierarchical clustering of these tis- 
sues revealed grouping of distinct clusters of eye, 
basal ganglia, hypothalamus, thalamus (THA), 
and cortices (fig. S2B). The unbiased cluster- 
ing of ontologically related organs and brain 
nuclei validated dissection, sampling, and analy- 
ses protocols. 

Out of the 25,098 transcripts detected in at 
least one organ, 10,989 transcripts were common 
to all 64 tissues. These ubiquitously expressed 
genes (UEGs) encode gene products necessary 
for basic cellular functions, including basic tran- 
scription, RNA processing, DNA repair, protein 
homeostasis, and secretory function (table S3). 
The remaining 14,109 genes were expressed in 
distinct sets of tissues, with 1473 genes expressed 
exclusively in one tissue (table S4 and fig. S2D). 
Tissues differed in the number of expressed genes; 
the top quartile included highly heterogeneous 
tissues such as the arcuate nucleus (ARC), supra 
optic nucleus (SON), ventromedial regions of the 
hypothalamus (VMH), and the testis (TES). Each 
of these tissues expressed >17,000 genes, where- 
as the bottom quartile included relatively fewer 
heterogeneous tissues such as the abdominal 
and gastrocnemian muscles (MUA and MUG, 
respectively) and expressed 14,000 to 16,000 
genes (table S1). The number of expressed genes 
does not entirely represent the complexity of 
expression. Indeed, transcriptional complexity, 
expressed as the fraction of total RNAs accounted 
for by the 10 or 100 most frequently expressed 
genes (table S5) (9), showed wide distribution. 
The pancreas (PAN) was the least complex tissue 
(top 10 expressed genes, accounting for >60% of 
all sequenced tags), whereas the TES transcrip- 
tional output was the most complex (Fig. 2D). 
Although several brain regions are among the 
top quartile in terms of number of expressed 
genes, they showed a relatively low complexity 
of gene expression. In 16 out of 24 brain regions, 
the 10 most highly expressed genes accounted 
for >50% of all genes expressed (Fig. 2D). Thus, 
transcriptome signature confers tissue identity 
in this primate as it does in humans (9, J7). 


Diurnal gene expression across tissues 


Diurnal rhythms of gene expression from the 
12 time points for all tissues were analyzed with 
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Fig. 1. Tissue collection. List of the 64 tissues collected, according to system and functional types 


(including 22 brain regions and nuclei). 


MetaCycle (12), an R package that integrates 
multiple algorithms to determine periodicity and 
rhythmicity. Transcripts with ~24-hour rhythms 
are discussed further. All 64 tissues examined 
showed rhythmic genes (Fig. 3A and table S6), 
but there was a clear difference between tissue- 
specific gene expression and tissue-specific rhyth- 
mic gene expression. In general, the number of 
rhythmic transcripts in a given tissue was not 
correlated to the number of genes expressed 
in the same tissue (fig. S3A). Furthermore, the 
fraction of the transcriptome detected as rhyth- 
mic and the extent of rhythmic genes shared 
among tissues were more complex than spatial 
gene expression signatures of tissues (Fig. 3C 
and table S7). In thyroid (THR), stomach fundus 
(STF), gastrocnemian muscle (MUG), paraven- 
tricular nucleus (PVN), and prefrontal cortex (PRC), 
more than 3000 genes were detected to cycle, 
whereas in the pineal (PIN), mesenteric lymphon- 
odes (MEL), supraoptic nucleus (SON), lateral hy- 
pothalamus (LH), and bone marrow (BOM), less 
than 200 genes had 24-hour rhythms of expres- 
sion (Fig. 3, A and B). More than 50% of genes 
detected in a given tissue were also expressed in 
all 64 tissues. However, even among tissues with 
>1000 rhythmic genes, less than 1% (<10) rhyth- 
mic genes were shared. Becuse of this limited 
overlap of rhythmic genes between tissues, and 
even though <4000 genes cycle in most tissues, 
16,442 genes (65.5% of all expressed genes) (table 
S8) were rhythmic, with a ~24-hour period in at 
least one tissue (Fig. 3, A and C). Rhythmic tran- 
scripts were particularly enriched in PCGs (fig. S3, 
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B and C), accounting for 81.7% of expressed 
PCGs (15,269 of 18,684). Thus, as seen in rodents 
(1, 13, 14) and insects (2), rhythmic transcription 
is tissue-specific. More importantly, more than 
two-thirds of the transcriptome showed ~24-hour 
rhythms in expression. 

Tissue-specific rhythmic expression is not a 
function of tissue-specific gene expression. Sur- 
prisingly, in nearly all tissues, the majority of 
cycling genes were UEGs (76.7% on average) 
(Fig. 3A). Of the total 10,989 UEGs, the vast ma- 
jority (10,607 genes, 97%) cycled in at least one 
organ (Fig. 3D), leaving only 382 UEGs not de- 
tected as rhythmic in any of the 64: tissues (table 
89). Comparison of the number of cycling and 
noncycling UEGs as a function of the number of 
tissues analyzed indicated that with an increased 
sampling of cell and tissue types, all UEGs may 
indeed cycle in at least one organ (Fig. 3D). This 
implies that although the rhythmic transcriptome 
is tissue-specific, the majority of rhythmic genes 
is still expressed in each tissue but may not cycle 
in other tissues. This suggests that rhythmic genes 
undergo tissue-specific transcriptional regulation. 
Because UEGs participate in fundamental cell 
biological processes, circadian regulation of sub- 
sets of these genes in different organs may re- 
flect a mechanism for organ-specific modulation 
of basic cellular function. For example, in endo- 
crine organs, the diurnal regulation of exocyto- 
sis specifically alters basic cellular functions to 
produce rhythmic release of endocrine factors. 

Because MetaCycle detects rhythmic genes 
regardless of transcript abundancy (12), we also 
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examined the relation between gene rhythmic- 
ity and amount of expression. By calculating the 
fraction of rhythmic genes in each decile of av- 
erage expression values, we found that in 60% 
of the tissues, the most rhythmic genes are among 
the genes that account for the top three deciles 
of transcriptional output (the 30% most expressed 
genes), whereas less than 1% of rhythmic genes 
were found in the decile of the least expressed 
genes (Fig. 3E). Such rhythmic expression of high- 
ly expressed genes supports the hypothesis that 
rhythmic transcription reduces or optimizes en- 
ergy expenditure in gene expression (15). 

We analyzed the times of peak expression 
(phase) of rhythmic genes. The phases in each 
tissue were not randomly distributed through- 
out the 24-hour day. The peak phases of expres- 
sion of rhythmic genes for nearly all organs were 
largely clustered in one or two narrow temporal 
windows (of <6 hours) (Fig. 4). In tissues such as 
amygdala (AMY), habenula (HAB), optic nerve 
head (ONH), PVN, SCN, VMH, LH, lateral globus 
pallidus (LGP), substantia nigra (SUN), PAN, 
PRC, prostate (PRO), and spleen (SPL), more than 
two-thirds of rhythmic genes peaked within a 
narrow (<6 hour) interval, whereas some tissues— 
including adrenal gland cortex (ADC), aorta (AOR), 
cerebellum (CER), dorsomedial hypothalamus 
(DMH), THA, and putamen (PUT)—had genes 
that peaked within two distinct time intervals. 
However, even for anatomically adjacent tis- 
sues these phase clusters were temporally dis- 
tinct. For example, in the kidney cortex (KIC) 
the main phase cluster was at ZT2, whereas in 
the medulla (KIM), it appeared at ZT7. Simi- 
larly, along the digestive track the phase clus- 
ters of esophagus (OES), STF, antrum (ANT), 
duodenum (DUO), ileum (ILE), ascending colon 
(ASC), and descending colon (DSC) were dis- 
tinct (Fig. 4). A compilation of peak phases of 
expression across all rhythmic genes in all tis- 
sues revealed two major peaks, early afternoon 
(11,146 genes peak at ZT8) and late night or 
dawn (5127 genes peak at ZT22), with a distinct 
trough around midnight (696 genes peak at 
Z7116) (Fig. 3F and fig. S3D). This relative “qui- 
escent zone” in the first half of the night (ZT14 to 
-19) is thus a distinct feature of rhythmic tran- 
scriptional output in the diurnal primate. 


Tissue-specific features of the 
molecular clock 


Because the number of common rhythmic genes 
between tissues is rather limited, we examined 
which genes were most widely detected as rhyth- 
mic in multiple tissues. No single transcript was 
scored significantly rhythmic in all 64 tissues. 
Genes encoding for the core circadian clock com- 
ponents and their immediate output targets were 
among the genes detected as rhythmic in most 
(but not all) tissues (Fig. 5A and table S10). The 
recently described circadian repressor gene Ciart 
[Circadian-associated transcriptional repressor 
(or Chrono)] was detected cycling in the most 
number of tissues (52 tissues) (Fig. 5A and fig 
S4A). For the 13 transcriptional regulators con- 
sidered canonical components of the circadian 
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clock (Perl-3, Cryl-2, Clock, Bmall, Npas2, Rora, 
Rorb, Rorc and Nridi-2), their relative abundance 
and robustness of daily rhythms varied across 
tissues. Of the 13 components, 11 were detected 
in all 64 tissues, whereas Rorb and Rorc were 
expressed in 60 and 62 tissues, respectively. The 
median expression of clock components also 
varied across tissues, with paralog pairs (Cry1 
and Cry2; NridI and Nrid2) exhibiting mutually 
complementary expression in different tissues 
(Fig. 5B and fig. S4B). Within the hypothalamus, 
the SCN, PVN, preoptic area (PRA), and DMH 
showed a distinctive feature: Both Cry and Nrid 


paralogous pairs were expressed in equivalent 
amounts (Fig. 5B), whereas in most other brain 
regions and organs, one of the members was 
expressed in higher amounts. Although core 
clock genes did not oscillate in all tissues, nine 
of the clock components showed daily rhythms in 
more than 20 tissues (Fig. 5A). Furthermore, the 
amplitude of oscillation of core clock compo- 
nents varied between tissues (Fig. 5B). The het- 
erogeneity in the overall transcript abundance 
and rhythmicity of expression of circadian clock 
components implies different composition of 
core activators, repressors, and modulators in 
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Fig. 2. Transcriptome complexity. (A) Timing of sampling (gray arrows) across the 24-hour 

cycle (dark phase in gray shading) and timing of meals (green arrows) superimposed on the 
daily cycles of locomotor activity (black), body temperature (blue) (averages for 12 animals, + 95% 
confidence interval, over the 14 days preceding the sample collection, average activity and 
temperature periods respectively 23.98 + 0.02 and 23.97 + 0.07 hours), and cortisone (plasma, 
solid red line; cerebrospinal fluid, dotted red line). (B) Number of genes expressed (average FPKM 
of the 12 time points >0.1) in at least one of the 64 tissues (25,098, left) or present in all the 
tissues sampled (UEG, 10,989, right). Different colors indicate the gene types. (C) Principal 
component analysis performed on the 12 time points of 36 representative tissues shows clustering 
of similar tissues groups based on gene expression profiles. The 22 tissues constituting the brain 
group clustered very tightly apart from the peripheral organs. (D) Cumulative distribution of the 
average fraction of total transcription contributed by genes when sorted from most to least 
expressed in each tissue (x axis). In all figures, each tissue is color-coded according to its system 


or functional group presented in Fig. 1. 
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different tissues, generating tissue-specific char- 
acteristics of core oscillators and input and out- 
put mechanisms. 

Although the transcript abundance and rhyth- 
mic properties of clock gene expression varied 
across tissues, the peak phase of expression of 
each clock component was mostly restricted 
to a roughly 6-hour window (Fig. 5C, fig. S4C, 
and table S8). The mean peak times of expres- 
sion for Bmall and Npas2 were similar (Bmall, 
Z1T13.3 + 0.2; NPAS2, ZT14.3 + 0.5). Although 
Bmall (heterodimerized with CLOCK or NPAS2) 
is considered to activate Per, Cry, Nrid, and Ror 
expression, the peak phases of expression of 
these targets were not identical. Perl, Per2, and 
Cry2 shared similar peak phases of expression 
(ZT2.4, -2.9, and -2.5, respectively), whereas Cry1 
peaked around ZT11. Nridi and Nrid2 peaked 
around ZT22 to ZT23, and Rorc peaked around 
ZT8 (fig. S4D and table S11). Although differ- 
ent combinations of paralogous clock compo- 
nents and their abundance provide evidence 
for tissue-specific characteristics of the core os- 
cillator, the relative phases of expression of these 
components are remarkably conserved across 
tissues. 


Basic cellular functions are rhythmic 


Because tissues differed in the number of rhyth- 
mic genes and shared only a limited number of 
them (Fig. 3, B and C), even in terms of the cir- 
cadian clock components (Fig. 5A), it is likely 
that different components of a given metabolic 
or signaling pathway were rhythmic in different 
tissues. To examine this, we used Gene Ontol- 
ogy (GO)-based annotation of rhythmic genes 
in different tissue types (J6) and compared the 
ontology terms that were significantly overrep- 
resented in at least 10 organs. As expected, among 
the 32 Kyoto Encyclopedia of Genes and Genomes 
(KEGG) biochemical pathways shared by at least 
10 organs, circadian rhythmicity was the most 
frequently overrepresented. Other highly over- 
represented pathways were those implicated in 
DNA replication, DNA repair, protein ubiquitina- 
tion, the mTOR (mechanistic target of rapamycin) 
pathway, lysosomal function, amino-sugar me- 
tabolism, pyruvate metabolism, and oxidative 
phosphorylation (Fig. 6A and fig. $5). Overall, 
basic energy metabolism pathways and path- 
ways that ensure the integrity of DNA, protein, 
and other macromolecules showed daily rhythms 
in multiple tissues. Phase set enrichment analysis 
(PSEA) (77) allowed us to calculate the peak phase 
of various ontology groups. Although DNA dam- 
age repair and lysosomal function groups showed 
peak expression during the afternoon (Fig. 6B), 
other functional groups were relatively more 
dispersed. Genes in the mTOR pathway peaked 
mostly when animals ate, whereas the protein 
ubiquitination pathways components were ex- 
pressed before those of the mTOR pathway and 
were mainly active between midnight and noon. 
Annotation along cellular compartments also 
complemented the pathway-centric annota- 
tion. Specifically, cellular transcripts that encode 
components participating in protein synthesis, 
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protein degradation, autophagy, and mitochon- | lation of the secretory pathway is more impor- | muscle (MUS), AOR, white adipose tissue (WAT), 


drial function commonly showed daily varia- tant for earlier stages of secretory machinery. CER, brainstem (BST), pituitary (PIT), and SCN]. 
tion in abundance (Fig. 6C). Golgi-related GO : These data sets are from a young male C57BL/6 
terms were enriched within the cellular compart- | Comparative rhythmic transcriptomes mouse strain fed a standard laboratory diet ad 
ments rhythmic in the most number of tissues. In between mouse and baboon libitum. Although these data were collected from 
particular, components of the cis side of the Golgi | The data set allowed comparison between di- | mice that were released into constant darkness 
apparatus, such as the endoplasmic reticulum- | urnal baboon and nocturnal mouse in the phases | for 1 to 2 days before tissue collection, the phase 


Golgi intermediate compartment, were the most | of circadian clock components and rhythmic | of expression of clock genes under such condi- 
represented among the most widely regulated (Fig. | transcriptome in the SCN and other tissues. We | tions deviate <1 hour from that observed under 
6D). By contrast, tissues showing daily rhythms selected 11 tissue data sets from rodents (J, 78) a normal light-dark cycle (13). We compared the 
in components of the distal Golgi complexes | for which we obtained comparable data sets in | phase of the core clock components in baboons 
were relatively few in number (Fig. 6D), which the baboon [including liver (LIV), lungs (LUN), and mice and found that the peak phase of ex- 
possibly reflects the fact that rhythmic regu- heart (HEA), kidney (KID), adrenal gland (ADR), pression of each clock component was shifted 
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by ~12 hours between mouse and baboon tis- | morning and evening, respectively. The peak 
sues (with the notable exception of the SCN). phase of expression of other clock components 
This is evident for Bmall and Per! (Fig. 7, A | similarly showed opposite phases in most organs. 
and B), which showed peaks of expression in | However, the expression of Cryl was distinct. In 
the baboon in evening and morning, respectively, | the baboon, the median phase for Cryl expression 
whereas in mice, the peak phases occurred in | was during late afternoon, whereas in mice, its ex- 
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pression is delayed by only 7 hours, to peak after 
midnight (Circadian Time 18). In the SCN, unlike 
other tissues, the phase of expression of circadian 
clock components was not systematically opposite 
to that of their baboon orthologs. The peak phase 
of Bmall was similar between baboon and mouse 
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Fig. 4. Tissue-specific rhythmic gene expression. Radial plot of the distribution of the peak phase of expression of the cycling genes in each of the 64 
tissues of the present atlas. On the first plot, gray indicates ZT, and number of gene peaks of expression are listed in black. 
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Fig. 5. The molecular clock across tissues. (A) Distribution of the cycling 
genes ranked according to the number of tissues in which they cycle. 

The most frequent cycling clock component is Ciart, cycling in 52 tissues, 
followed by Bmall and Per. The contributions of the core clock components 
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and output (Dbp) to this distribution are highlighted. (B) Detailed transcript 
abundance time course of the molecular clock components in the 64 tissues 
examined [log2(1 + FPKM)]. (C) Distribution of the peak phases of core 
clock components in the tissues where they are detected as cycling. 
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SCN—that is, peaking in early evening—whereas 
Perl and Cry2 phases were 6 and 12 hours apart, 
respectively (Fig. 7C). This implies a complex 
diurnal-nocturnal switch downstream of circa- 
dian clock components of the SCN. 


Fig. 6. Rhythmic 
transcription drives 
coordinated cellular 
organization and 
rhythmic physiology. 
(A) Heatmap of the 
KEGG pathways 
enriched for cycling 
genes in more than 


A KEGG pathways 


10 tissues. Pathways men 
satisfying the criteria vy 
of overrepresentation NH 
analysis (ORA) (gene ie 
number > 2, z Ag 
score > 2, and per- Aue 
muted P < 0.05) are we 
shown in red (red BEE 
intensity codes for the Sh 
Z score). Insignificantly ee 
enriched pathways are Gey 
represented in black. on 
(B) Phase distribution HEA 
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of representative ne 
KEGG pathways. PAN 
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culated and statisti- CES 
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PSEA tool. (C) Heat- ac 
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genes in more than ne. 
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analysis as described ae 
for (A). (D) Histogram ae 


of the number of tissues 
showing overrepre- 
sentation for Golgi- 
related GO-cellular B 
component terms 
sorted from the cis to 
the trans side of the 
Golgi apparatus. Super- 
imposed is the cumu- 
lative fraction of the 
rhythmic regulation 
for each GO term in 
the 45 tissues (out of 
64) in which at least 
one of these GO terms 
that are related to the 
Golgi apparatus was 
cycling. 
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Because the peak phases of expression of core 
clock components are ~12 hours apart between 
mouse and baboon, we tested whether other 
rhythmic genes in the same tissues shared sim- 
ilar phase differences. Given the differences in 
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sampling (day-night in baboon versus constant 
dark in mice, timed eating in baboon versus ad 
lib eating in rodents, and differences in sam- 
pling frequency), we anticipated a limited num- 
ber of common cyclers between baboon and 
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mouse. Indeed, although the number of genes 
expressed in baboon and mouse tissues showed 
a positive correlation [coefficient of determina- 
tion (R®) = 0.6, P = 0.0145, except for the liver] 
(fig. S6A), the absolute numbers of cycling genes 


were not correlated (R? = 0.14, P = 0.29), with 
relatively few common rhythmic genes (Fig. 7, D 
and E, and fig. S6, B and C). The peak phases of 
expression of rhythmic genes in corresponding 
tissues in baboon and mouse showed distinctive 


features. Specifically, the primate showed a con- 
solidated period of transcription during the day 
and a transcriptionally quiescent period during 
the night (when the animals are asleep), where- 
as there was no such period in rodents (Fig. '7F). 
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Fig. 7. Comparison of rhythmic transcriptome organization in the 
diurnal primate and nocturnal rodent. (A) Normalized daily expression 
profiles of Bmall, Perl, and Cryl across 10 tissues in mouse and baboon 
(ADR, AOR, BST, CER, HEA, KID, LIV, LUN, MUS, and WAT). (B) Phase 
distribution of the main clock genes in the baboon (left) and the mouse 
(right). The colored fraction of the rings corresponds to the confidence 
interval (95%) of the phases of the corresponding clock genes in the 
tissues in which these genes were detected as cycling. (©) Comparison of 
the phase of Bmall, Perl, Cry1, and Cry2 in the SCN (dot) and in the other 
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MUS 


649 2615 


the tissues (95% confidence interval) in mouse (orange) and baboon 
(blue). Baboon Cry1 and Cry2 are not detected as cycling in the SCN; their 
maximum expression is represented (crossed circle). (D) Numbers of 
cycling genes in selected tissues compared in the baboon and the mouse 
and overlap of common cycling genes in the two species. (E) A tissue- 
by-tissue comparison showing the lack of correlation of the number of 
cycling genes in the baboon and in the mouse (R* = 0.086, P = 0.38). 
(F) Normalized peak phases of gene expression in 12 tissues throughout 
the day (left in the baboon, right in the mouse). 
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Furthermore, among the common cyclers be- 
tween mouse and baboon in the different tis- 
sues, we did not observe a consistent 12-hour 
difference in the peak phases of expression. This 
is in contrast to what would be expected from 
the hypothesis of a simple nocturnal-diurnal 
“switch” downstream of the SCN. For example, 
in HEA, 47% of shared rhythmic genes cycled 
with <6-hour phase difference between mouse 
and baboon, whereas in other organs, such as 
CER, 65% of shared genes cycled with a 10- to 
15-hour phase difference (fig. S6D). This suggests 
a very diverse and autonomous temporal orga- 
nization across peripheral tissues. 

Our study provides a comprehensive spatiotem- 
poral gene expression atlas in a diurnal primate, 
allowing the investigation of both tissue-specific 
and diurnal expression patterns underlying phys- 
iology and behavior. Although it has been as- 
sumed that almost every tissue in mammals 
harbors daily rhythms in gene expression, this 
study specifies the identities of rhythmic genes 
in a wide range of tissues covering all major func- 
tions. The abundance and complexity of gene 
expression in a given tissue had little correlation 
with the number or diversity of rhythmic genes 
in the same tissue. The temporal patterns of ex- 
pression add a distinctive dimension to tissue 
and organ functional specializations that cannot 
be inferred from the tissue’s transcriptome sig- 
nature alone. The tissue-specific genes are the 
less likely to be circadianly expressed. Across 
tissues, the majority of rhythmic genes are among 
UEGs. At the organismal level, two-thirds of rhyth- 
mic genes are from the 10,989 UEGs. Similar to 
UEGs, the majority of core circadian clock compo- 
nents were detected in nearly all tissues, but only a 
few out of the 13 component showed rhythmic 
expression in any given tissue. The phase of ex- 
pression of any clock component, when rhyth- 
mic, was similar across tissues. This would imply 
that a core circadian clock in a given tissue is 
composed of a specific combination of different 
paralogous pairs of activators and repressors, 
each oscillating at their respective phases. Such 
tissue-specific organization of the core clock and 
tissue-specific oscillations of UEGs lends sup- 
port to the idea that a primordial circadian sys- 
tem, as found in unicellular cyanobacteria, tuned 
the rhythmic expression of a vast majority of the 
genome to orchestrate daily rhythms in almost 
all essential biochemical and cellular processes. 
With the evolution of metazoans and tissue spe- 
cialization, the basic set of genes necessary for 
cellular function is expressed in all tissues, and the 
circadian clock still continues to oscillate in similar 
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phase, but it recruits a tissue-specific gene reper- 
toire among the UEGs for rhythmic expression. 

The daily rhythms expressed in >80% of the 
PCGs, encoding diverse biochemical and cellular 
functions, constitutes by far the largest regu- 
latory mechanism that integrates diverse bio- 
chemical functions within and across cell types. 
From a translational point of view, rhythmicity 
may have a major impact because 82.2% of genes 
coding for proteins that have been identified as 
druggable targets by the U.S. Food and Drug 
Administration (www.drugbank.ca) (19) show 
cyclic changes in transcription in at least one 
tissue (table S12). These percentages may be 
underestimations because it is likely that in 
heterogeneous tissue such as lymph nodes, genes 
oscillating in only a subset of cell types may not 
be scored as rhythmic. Because daily rhythms 
arise from interactions between the circadian 
clock, light-dark cycle, temperature cycles, and 
daily eating and sleeping patterns (fig. S7), the 
data set—which was collected under defined con- 
ditions known to contribute to daily rhnythms— 
establishes a reference atlas to understand how 
genetic and environmental factors may contrib- 
ute to circadian rhythm disruptions and lead to 
specific pathologies in humans. 
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HUMAN GENOMICS 


Phenotype risk scores identify patients 
with unrecognized Mendelian 


disease patterns 


Lisa Bastarache,’ Jacob J. Hughey,’ Scott Hebbring,” Joy Marlo,’ Wanke Zhao,” 
Wanting T. Ho,” Sara L. Van Driest,*”® Tracy L. McGregor,” Jonathan D. Mosley,* 
Quinn S. Wells,**® Michael Temple,’ Andrea H. Ramirez,* Robert Carroll,! 

Travis Osterman,”* Todd Edwards,* Douglas Ruderfer,* Digna R. Velez Edwards,” 
Rizwan Hamid,” Joy Cogan,” Andrew Glazer,* Wei-Qi Wei,! QiPing Feng,® 
Murray Brilliant,” Zhizhuang J. Zhao, Nancy J. Cox,* 


Dan M. Roden,’*’* Joshua C. Denny’** 


Genetic association studies often examine features independently, potentially missing 
subpopulations with multiple phenotypes that share a single cause. We describe an 
approach that aggregates phenotypes on the basis of patterns described by Mendelian 
diseases. We mapped the clinical features of 1204 Mendelian diseases into phenotypes 
captured from the electronic health record (EHR) and summarized this evidence as 
phenotype risk scores (PheRSs). In an initial validation, PheRS distinguished cases and 
controls of five Mendelian diseases. Applying PheRS to 21,701 genotyped individuals 
uncovered 18 associations between rare variants and phenotypes consistent with 
Mendelian diseases. In 16 patients, the rare genetic variants were associated with severe 
outcomes such as organ transplants. PheRS can augment rare-variant interpretation and 
may identify subsets of patients with distinct genetic causes for common diseases. 


lassically, Mendelian diseases are thought 

to be rare, caused by variants with large ef- 

fect sizes, and associated with considerable 

morbidity and mortality. Many are charac- 

terized by a range of clinical phenotypes, 
often affecting multiple organ systems. Several 
lines of evidence suggest that genes known to 
cause Mendelian disease also harbor variants that 
contribute to complex disease (7). Studies have 
identified clinical overlap in patients codiag- 
nosed with Mendelian and complex diseases (2), 
and single-nucleotide polymorphisms found in 
genome-wide association studies (GWASs) are 
enriched for Mendelian loci (3). A review of evi- 
dence from GWASs and whole-exome sequenc- 
ing studies found an overlap between primary 
immunodeficiency genes and complex inflam- 
matory diseases (4). Collectively, this evidence 
suggests that variants in Mendelian disease- 
causing genes may be an underrecognized con- 
tributor to complex disease. 
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Until very recently, the phenotypic effects of 
rare genetic variants were ascertained primarily 
through family-based studies of patients with 
distinctive and often severe phenotypes. Population- 
level techniques, such as GWASs and phenome- 
wide association studies (PheWASs) (5, 6), are not 
easily applied to rare variation because most 
studies are underpowered. Cohorts large enough 
to support GWASs of rare variants have only re- 
cently been assembled and have demonstrated 
the potential impact of rare variants on complex 
traits such as height, finding rare variants with ef- 
fect sizes much greater than those of common 
variants (7). 

Estimating the pathogenicity of rare variants 
remains a challenge and a barrier to use in the 
clinical setting (8). Many algorithms have been 
developed to predict variant pathogenicity (9-11), 
and consortia such as ClinGen (72) are aggregat- 
ing knowledge to enable expert determinations. 
Resources such as ExAC (13) have helped refine 
variant interpretation. Some variants previously 
interpreted as pathogenic are too common in 
some populations to cause rare, life-threatening 
disorders (74), whereas others thought to be com- 
pletely penetrant do not always cause disease (15). 
Initial studies suggest that electronic health records 
(EHRs) linked to genetic data may help drive 
genomic discovery and define clinical phenotypes 
associated with rare variants (16-18). 

We have developed an approach that increases 
the power to detect rare-variant associations 
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by leveraging the phenotypic patterns of Men- 
delian diseases. By mapping the clinical man- 
ifestations of a Mendelian disease to phenotypes 
extracted from the EHR, we can compute a 
“phenotype risk score” (PheRS) that expresses 
the degree to which an individual’s symptoms 
overlap with a Mendelian disease. We defined 
a PheRS as a weighted aggregation of genetically 
related phenotypes, analogous to the genetic risk 
score approach for analyzing multiple variants 
against a single phenotype. PheRS was validated 
against clinically diagnosed cases and controls, 
and a genetic association study of PheRS pro- 
files for 1204 Mendelian conditions identified 
both known and previously unknown associa- 
tions with variants in target genes. The approach 
presents a method for measuring the pheno- 
typic impact of rare variants and for identifying 
the heretofore underrecognized contribution of 
Mendelian disease genes to common medical 
conditions. 


Constructing a phenotype risk score 


The Online Mendelian Inheritance in Man (OMIM) 
database provides clinical synopses for thousands 
of monogenic diseases (79) that have been anno- 
tated using the Human Phenotype Ontology 
(HPO) (20). We created a map from HPO terms 
to consolidated billing codes from the EHR called 
phecodes. Phecodes enable high-throughput as- 
certainment of EHR phenotypes and have been 
widely used to replicate known genetic associa- 
tions and discover new ones (27-23). By mapping 
HPO terms to phecodes, we can express “pheno- 
type syndromes” patterned after Mendelian dis- 
eases in OMIM in terms of clinical phenotypes 
that can be rapidly derived from the EHR. The 
PheRS for a given Mendelian disease is defined 
as the sum of clinical features observed in a given 
subject weighted by the log inverse prevalence of 
the feature. 


Validating PheRS 


We compared the PheRSs of clinically diagnosed 
cases with matched controls for six Mendelian dis- 
eases. PheRS was a very strong predictor of case 
status for five of the diseases (Wilcoxon rank sum 
test; P = 8 x 10° to 5 x 10°”) (Fig. 1, A and B). 
The exception was phenylketonuria (P = 0.28), 
which effectively served as a negative control be- 
cause newborn screening and dietary avoidance of 
phenylalanine essentially eliminates disease 
manifestations in affected individuals (24). The 
PheRS for each Mendelian disease demonstrated 
specificity for the target disease: The cases for dif- 
ferent Mendelian diseases had similar PheRS dis- 
tributions to those for controls (Fig. 1C). The lone 
exception was that the PheRS for hereditary he- 
mochromatosis (HH) was significantly elevated 
for cystic fibrosis (CF) cases versus controls. How- 
ever, even in this instance, CF cases had PheRSs 
that were three times as high for CF compared 
with HH. A review of controls with a PheRS 
greater than the 75th percentile identified one in- 
dividual (PheRS > 99th percentile) who was diag- 
nosed with HH in the 6 months after the case/ 
control ascertainment. Thus, for this individual, 
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the PheRS suggested the diagnosis before it was 
made by providers. 


PheRS identifies potentially pathogenic 
variants in Mendelian disease genes 


We conducted an association analysis based on 
a cohort of 21,701 adults of European ancestry 
genotyped on the Exome BeadChip (table S1). In 
this cohort, we computed PheRSs for 1204: Men- 
delian diseases (1096 causative genes) for which 
we had sufficient genotype data. We tested for 
association between PheRSs and 6188 rare var- 
iants (minor allele frequency < 1%) using linear 
regression, assuming a dominant genetic model. 
We only tested the PheRS for a particular Men- 
delian disease against variants in the gene or 
genes known to cause that disease. We found 18 
significant associations between rare variants 
and PheRSs (false discovery rate g < 0.1; Table 1). 
All significant results had a positive beta coeffi- 
cient, indicating that the variants were associated 
with an excess of Mendelian disease phenotypes. 
Four of the genes had an established dominant 
mode of inheritance, whereas the remaining 
13 genes were known as exclusively or primarily 
recessive. Four were annotated in ClinVar as 
“pathogenic” or “likely pathogenic,” and the Hu- 
man Gene Mutation Database (HGMD) provided 
evidence of pathogenicity for an additional three 
variants (25). The phenotypic impact of the re- 
maining nine variants have not, to our knowl- 
edge, been previously described. 

Clinical chart review revealed that eight of the 
807 individuals with statistically significant var- 


Cystic Fibrosis 


Marfan Syndrome 


iants were diagnosed with the target Mendelian 
disease. Seven individuals with one of the two 
CFTR variants (p.G542* and p.R553*) were diag- 
nosed with CF. Clinical genetic testing confirmed 
the variants called on the Exome BeadChip, in- 
cluding one homozygote for p.G542*, and estab- 
lished compound heterozygosity with AF508 for 
five others (Fig. 2A and table $2). All individuals 
diagnosed with CF had a PheRS greater than 
four standard deviations from expected values. 
Additionally, the highest-scoring heterozygote 
for p.E168Q in HFE was diagnosed with HH on 
the basis of clinical findings. The diagnosis of 
HH was considered but never confirmed for 
another p.E168Q heterozygote who died of end- 
stage liver disease. 

Although the majority of patients with signif- 
icant variants were undiagnosed, these individuals 
had a high burden of severe end points related to 
the Mendelian diseases. Of the 40 heterozygotes 
for HFE p.E168Q, four had liver transplants (10 
versus 1.2% in background; P = 2.1 x 10°; Fisher’s 
exact test). Individuals with variants in two genes 
associated with renal failure had elevated rates of 
kidney transplant: Five of 36 patients (14%) with 
AGXT p.A295T received transplants (another is 
awaiting transplant), as well as two of 15 patients 
(13%) with DGKE p.W322* (versus 3% in back- 
ground; P = 6.9 x 10” and P = 0.088, respectively; 
Fisher’s exact test). Four of 69 TG p.G77S het- 
erozygotes underwent thyroidectomies (6 versus 
2% of noncarriers; P = 0.039; Fisher’s exact test). 
These are end-stage phenotypes, potentially re- 
sulting from the effects of these variants, and 


Achondroplasia 


did not have an increased prevalence in other sig- 
nificant variant carriers (table S3). Additionally, we 
found that the population attributable fraction for 
the constituent PheRS phenotypes averaged 0.5%, 
with a maximum of 4.5%, suggesting that common 
diseases in adult populations may, in some cases, 
be attributed to variants in Mendelian genes (fig. S1). 

An examination of PheWAS results using 
Fisher’s exact test for variants identified in the 
discovery analysis revealed that constituent phe- 
notypes were often marginally significant (P < 
0.05), while not crossing the Bonferroni correc- 
tion level for a single PheWAS. For CFTR p.G542*, 
three features used in the PheRS for CF achieved 
marginal significance (bronchiectasis, disease of 
the pancreas, and chronic airway obstruction) 
(Fig. 2B). However, the constituent phenotypes 
for CF were only statistically significant when 
they were analyzed collectively as a PheRS. The 
association of p.G542* with the PheRS for CF 
was similar to the association with the pheno- 
type of CF itself (PheRS by linear regression, P = 
3 x 10°, versus CF diagnosis by Fisher’s exact 
test, P = 8 x 10~”). Similarly, although individuals 
with variant p.W322* in DGKE had an excess of 
nephrotic syndrome features (Fig. 2C), these 
phenotypes were not significantly associated 
on their own in the PheWAS analysis (Fig. 2D). 
A similar pattern was observed for the remain- 
ing variants identified in the discovery analysis 
(figs. S2 to S17). A PheWAS analysis of all 6188 
variants tested in the discovery analysis and 1734 
phecodes did not yield any significant (g < 0.1) 
associations. 


A ™ B 
12 12 Disease Abbrev. Cases P 
P Cystic fibrosis CF 733. 5x10720 
P 3 Marfan syndrome MS 2442x1017 
6 
@ Achondroplasia = ACH 101. 4x10°! 
2 
4 Hereditary -43 
4 4 hemochromatosis MA - 8x10 
2 Li-Fraumeni LF 24 4x10 
syndrome 
0 =— P } == Phenylketonuria. PKU 133 0.280 
case control case control case control 
Hemochromatosis Li-Fraumeni Syndrome Phenylketonuria Cc 
10 4 4 PheRS definition 
CF MS ACH HH LF PKU 
8 F i CF 0.00 | 0.00 | 1.43 | 0.00 | 0.00 
5 
6 %= MS) 0.00 0.00 0.00 0.00 0.00 
2 = 
a = 
o 2 2 oO 
= © ACH) 0.00 | 0.00 0.00 | 0.00 
4 2 
5 HH} 0.00 0.00 0.00 0.00 
1 1 (s) 
2 oO 
% LF} 0.00 0.00 0.00 0.00 0.00 
oO 
0. 0: 0- 
PKU| 0.00 0.00 0.00 0.00 0.00 0.00 
case control case control case control 
Fig. 1. PheRSs capture the diagnostic fingerprint of Mendelian disease of cases and statistical significance between cases and controls (Wilcoxon 
in EHR data. Scores for six Mendelian diseases were calculated for clinically rank sum test) for each disease. (C) Matrix of standardized differences in 


diagnosed cases and controls matched by age, sex, race, and record length. 
(A) Boxplots of PheRSs for cases and controls for each disease. (B) Number 
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location (pseudomedian) of the PheRSs between cases and controls (by row) 
and for each Mendelian disease definition (by column). 
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Replication of previously 

unrecognized associations 

We attempted to replicate significant associa- 
tions from the discovery analysis in two inde- 
pendent cohorts: a European ancestry cohort 
from Marshfield Clinic (n = 9441) and a non- 
European ancestry cohort from Vanderbilt (n = 
3820) (tables S4 and S5). Each was tested as in 
the discovery cohort, using linear regression as- 
suming a dominant model, adjusted for age and 
sex. Only variants with at least 10 heterozygotes 
or homozygotes for the rare allele were tested. In 
the Marshfield cohort, both attempted associa- 
tions were replicated: p.G77S in TG with thyroid 
dyshormonogenesis PheRS (P = 5.0 x 10“) and 
p-R507H in FANI with karyomegalic interstitial 
nephritis PheRS (P = 8.2 x 107°) (table S6). In the 
Vanderbilt non-European ancestry cohort, we rep- 
licated two of three associations: p.A993A in 
KIFIA with spastic paraplegia PheRS (P = 1.9 x 
10~*) and p.A295T in AGXT with primary hy- 
peroxaluria type 1 PheRS (P = 3.9 x 10°). The 
association between p.R507H in FAN/ and karyo- 


megalic interstitial nephritis PheRS was not 
replicated in the non-European ancestry cohort, 
potentially owing to the small number of indi- 
viduals with the allele in the replication cohort 
(nm = 15). 


Sequencing individuals with hitherto 
unrecognized variants 


To test for additional rare variants segregating 
with high-PheRS individuals, we analyzed the 
whole-exome sequences of 84 individuals from 
the discovery analysis for seven of the significant 
variants (table $7), including individuals with 
elevated (n = 36) and nonelevated (n = 48) PheRSs. 
A total of four individuals were found to carry a 
second rare, nonsynonymous variant in the target 
gene (Fig. 3 and tables S8 and S9). Two were 
possible compound heterozygotes (phase could 
not be determined in this analysis) (PLCG2 and 
AGXT), and two were homozygotes for the var- 
iant identified in the discovery analysis (DGKE 
and AGXT, confirming the results from geno- 
typing). Three of the four individuals with con- 


firmed second variants had the highest PheRS 
for their respective diseases among those se- 
lected for whole-exome sequencing. 

The heterozygote for AGXT p.A295T who was 
found to have an additional rare AGXT variant 
(p.R381K) through whole-exome sequencing 
had the highest PheRS for primary hyperoxaluria 
type 1, a recessive condition characterized by 
nephrocalcinosis and oxalate nephrolithiasis; an 
EHR review revealed that he had calcium oxalate 
crystals evident in urinalysis. The second-highest- 
scoring individual, a confirmed heterozygote, was 
diagnosed with hyperoxaluria, which was attri- 
buted to his Crohn’s disease. The p.A295T homo- 
zygotes in the discovery and replication cohorts 
were no more symptomatic than their heterozy- 
gous counterparts. This evidence, along with the 
persistence of the signal after removing individ- 
uals with second variants, suggests that p.A295T 
may act as a strong risk factor for hyperoxaluria, 
with more severe manifestations occurring in 
individuals with additional genetic or environ- 
mental risk factors. 


Table 1. Significant associations between phenotype risk scores (PheRSs) for Mendelian disease and rare variants. Shown are significant results 
from the analysis of 7520 PheRS-variant pairs, generated using linear regression assuming a dominant model, adjusted for age and sex. All associations with 
a false discovery rate of g < 0.1 are included. The established mode of inheritance is listed in the “OMIM reported inheritance” column; AD indicates 
autosomal dominant, AR indicates autosomal recessive; and AR* indicates that disease has also been reported in heterozygotes. ClinVar designations are 
included, when available: P, pathogenic; LP, likely pathogenic; LB, likely benign; U, uncertain significance. Variants with relevant phenotype associations 

in HGMD are indicated with a “Y." Results from applying American College of Medical Genetics and Genomics (ACMG) interpretations are given in the 

last column; an arrow indicates that the interpretation changed in light of evidence presented in this paper. rsIDs are assigned by the dbSNP database. 


HOM, homozygote; HET, heterozygote. 


HOM on” Phenotype 
Gene Variant rsID HET Associated Mendelian Disease ies categories in PheRS| Beta P ClinVar HGMD ACMG 
CFTR peater "113993959 1/27 Cystic fibrosis AR BB BH 1.39 29x10 P YP 
CHRNA4 ee rs55855125 1/21 Nocturnal frontal lobe epilepsy, 1 AD | 0.58 9.0x10® u U 
DGKE ork rs138924661 1/14 Nephrotic syndrome, type 7 AR al BH 1.31 2.8x107 LP Y LPP 
SUOX eon rs202085145 0/24 Sulfocysteinuria AR i MM os2 17x10 vu U>P 
CFTR Sinner 1874597325 0/12 Cystic fibrosis AR HZ HW 1.81 24x10® P Y oP 
KIF1B nara 1841274468 0/21 Charcot-Marie-Tooth disease, 2A1 AD ea MM 079 = 5.3x10% U 
VWF sane 4 8144072210 0/21 Von Willebrand disease AR* Zi 0.53 8.6x10% Y U 
RIPYA Bree rs116297894 1/25 Spastic paraplegia-30 AR BW WH os84 1.3x10° ts LB>U 
F10 een rs149212700 0/15 Factor X deficiency AR’ Zi 0.62 1.9x10° U 
HFE  “Sureacin 146519482 0/40 Hemochromatosis a Be ee 1.08 4.0x10° uo YU 
TG cece 18142698837 0/69 Thyroid dyshormonogenesis a EB 0.26 6.0x10° Y USP 
SH2B3 Gas 8148636776 0/22 Familial erythrocytosis, 1 AD BMMMMM 1.48 = «6.1x10° U>P 
SPTBN2 Shraeaoiie 13145522851 0/11 Spinocerebellar ataxia AR* @ 0.75 9.0x10% U>LP 
FAN1 Cameos rs150393409 0/434 Interstitial nephritis, karyomegalic AR a a 0.15 9.9x10° LB-U 
PANK2 bala 8137852959 0/26 HARP syndrome AR Bn Hw 0.58 1.1x10% P YP 
SH2B3 sGleael rs148636776 0/22 Essential thrombocythemia aD if Lia 0.33 1.4x10" U>P 
AGXT een rs13408961 1/35 Primary hyperoxaluria, type | AR BMD WH ose 17x10* us LB-U 
PLCG2 Siesival rs190840748 0/10 Familial cold autoinflammatory syn. 3) AD ZB 0.70 1.9x10% U 
HB Neoplastic HB Nervous/Psychiatric/Sensory HB Digestive/Genitourinary HB Other symptoms/Injuries 


HB Endocrine/Metabolic/Blood 
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Relative Risk 
Acute upper respiratory infections 
Asthma 
Intestinal malabsorption 5 
Bronchiectasis 
Pneumonia 
Diseases of pancreas 
Anal & rectal conditions 
Biliary cirrhosis 
Hepatomegaly 
Pulmonary embolism & infarction 
Male infertility 
Idiopathic fibrosing alveolitis 
Acute bronchitis & bronchiolitis 
Chronic airway obstruction 
Hypovolemia 


Relative Risk 
70 


Chronic renal failure 


Acute renal failure 


10 


Nephritis; nephrosis; renal sclerosis 
Proteinuria 

Hemolytic-uremic syndrome 
Acquired hemolytic anemias 


Thrombocytopenia 


Fig. 2. Phenotypes and PheWAS for two var- 
iants associated with PheRS for cystic fibro- 
sis and nephrotic syndrome. For phenotype 
grids (A) and (C), each row corresponds to a 
phenotype used in the PheRS; each column 
represents an individual who is heterozygous or 
homozygous (starred) for the variant. The bar 
on the left of the grid indicates the relative risk 
(RR) for each phenotype compared with the 
wild type. In grid (A), individuals clinically diag- 
nosed with cystic fibrosis are enclosed in a blue 
box. PheWAS plots (B) and (D) show the 
PheWAS (Fisher's exact P value) for the variant 
in (A) and (C), respectively. The constituent 
phenotypes that define the PheRS are starred. 
All associations with P < 0.001 are labeled. 

The horizontal red and blue lines show the 
Bonferroni correction threshold for an individual 
PheWAS and the nominal (uncorrected) 

P = 0.05, respectively. Single-letter abbreviations 
for the amino acid residues are as follows: 

A, Ala; E, Glu; G, Gly; H, His; |, lle; K, Lys; Q, Gln; 
R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and Y, Tyr. 
An asterisk signifies a termination codon. 


The highest scorer for familial cold autoin- 
flammatory syndrome 3 (FCAS3), a dominant 
condition caused by variants in PLCG2, presented 
in the emergency room with a systemic “urticarial- 
type rash,” for which a cause was never identified, 
and continued to present with blistering rashes 
and lip and tongue swelling. Whole-exome se- 
quencing revealed that this patient harbors a 
second rare variant (p.R687S) in the SH2 domain 
of PLCG2. A nearby variant (p.S707Y), also in the 
SH2 domain, has been implicated in a related 
dominant disease that has overlapping features 
with FCAS3 (OMIM #614878) (26). 

The confirmed homozygote for p.W322* in 
DGKE, a recessive gene that causes nephrotic 
syndrome type 7, was diagnosed with hemolytic 
uremic syndrome as a child and received a kid- 
ney transplant in his teens. A genetic etiology for 
his symptoms was never explored. 

Sequencing confirmed the variants called on 
the Exome BeadChip for SUOX, SH2B3, SPTBN2, 
and 7G and did not reveal any additional rare 
variants in the target genes. For SH2B3, the lack 
of a second variant is consistent with an estab- 
lished dominant inheritance pattern, as well as 
the high proportion of heterozygotes in our cohort 
(20 of 22) with at least one feature of familial 
erythrocytosis. Individuals without a second var- 
jant in AGXT and DGKE, both of which are asso- 
ciated with recessive conditions, also had elevated 
PheRSs (Fig. 3). This stands in contrast to the 
heterozygotes for the CFTR variants, who did 
not have a significantly elevated PheRS (P = 0.51; 
linear regression assuming a dominant model, ad- 
justed for age and sex), consistent with a recessive 
inheritance model. These findings suggest a blurr- 
ing of the distinction between dominant and re- 
cessive labels for some genes. 

Sequencing did not reveal any additional rare 
nonsynonymous variants in the 36 individuals 
with nonelevated PheRSs. In general, individuals 
with the highest PheRSs were more likely to be 
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Fig. 3. Whole-exome sequencing reveals second variants among indi- 
viduals with high PheRSs and demonstrates disease risk in heterozy- 
gotes. Each point represents an individual who is heterozygous or 
homozygous for the variant labeled on the left. The x axis represents the 
z-score for the PheRS relative to what is expected given age and sex (using 
the residual from the PheRS). All individuals carry at least one copy of the 
variant indicated on the left; additional variants identified by whole-exome 
sequencing or clinical chart review are labeled for each individual; 


clinically diagnosed or have additional genetic 
variants related to their symptoms (fig. S18). 


Biologic validation of SH2B3, TG, and 
SUOX associations 


We selected three candidate previously unrecog- 
nized associations for biologic validation: SH2B3, 
SUOX, and TG. SH2B3 is a negative regulator of 
cytokine signaling in hematopoietic cells that 
operates through a direct interaction between 
its SH2 domain and JAK2 to attenuate JAK2- 
mediated activation of proliferative pathways 
(27). The variant identified in this study, p.E395K, 
is located in a region of the protein that is critical 
for its inhibitory function (28) and is near known 
disruptive variants (29). Human embryonic kidney 
(HEK) 293T cells stimulated with erythropoietin 
showed an increase in phosphorylated extra- 
cellular signal-regulated kinase (pDERK) levels that 
was quenched in the presence of wild-type SH2B3 
but not quenched with both the known p.R392E 
functional mutation and our p.E395K variant (Fig. 
4, A and B). 

Splicing prediction programs suggested a prob- 
able reduction in 5’ donor strength for SUOX 
p.R76S and possible generation of an exonic cryptic 
splice acceptor site by TG p.G77S. SUOX p.R76S is 
located at the conserved -1 position of the 5’ 
donor of exon 5. We demonstrated that the SUOX 
variant caused a decrease in exon inclusion from 
96 to 35% (unpaired two-tailed ¢ test; P < 0.001; 
Fig. 4C). No transcripts aside from the exon- 
included and exon-skipped transcripts were de- 
tected. Similarly, TG p.G77S resulted in altered 
splicing. The basal rate of exon inclusion was re- 
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duced from 65% for the wild-type TG exon to 
only 26% inclusion in the p.G77S exon (unpaired 
two-tailed t test; P < 0.001). These ratios were 
consistent across a range of cDNA concentra- 
tions and polymerase chain reaction (PCR) cycle 
numbers (Fig. 4, C and D). 


Comparison of PheRS with existing 
methods to determine 
variant pathogenicity 


Across all PheRS variant associations with nom- 
inal P < 0.05 (n = 454), functional annotations 
were significantly correlated with PheRS effect 
size (Wilcoxon rank sum test); splice donor/ 
acceptor and stop-gain variants tended to have 
the largest effect size, followed in decreasing or- 
der by missense, splice region, synonymous, and 
intron/untranslated region variants (fig. SI9A). 
Thirteen of 14 functional prediction methods 
trended or associated with the probability of 
finding associations with the PheRS; predictions 
from CADD (combined annotation-dependent 
depletion) (9), SiPhy (30), and Polyphen2 HVAR 
(10) were statistically significant (P < 0.05; 
Fisher’s exact test; fig. S19B). 


Discussion 


In our validation study, PheRS was very effective 
in identifying patients with diagnosed Mendelian 
disease by using only the phenotypic signatures. 
Applying PheRS to a genotyped population, we 
found an increased burden of phenotypes among 
individuals with rare variants in Mendelian dis- 
ease genes. Sequencing identified or confirmed 
second rare variants in four individuals, three of 
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homozygotes confirmed by sequencing are labeled “HOM.” Additional 
CFTR variants were ascertained from clinical testing in the EHR; all other 
individuals were sequenced for this study. Clinically diagnosed individuals 
are represented as squares; all others are shown as circles. Where additional 
variants were found, the association test from the discovery analysis was 
repeated after dropping individuals with a second variant (P values 
generated using linear regression assuming a dominant model, adjusted for 
age and sex), and the P value is recorded under the gene/variant label. 


whom had the highest PheRS among all hetero- 
zygotes or homozygotes for that variant. In vitro 
studies provided supporting evidence of patho- 
genicity for all three variants tested. 

Although our approach relies on many dec- 
ades of accumulated knowledge about the pheno- 
typic imprint of Mendelian disease, the method 
itself is simple to implement. Our ability to rep- 
licate results in an external cohort suggests that 
it is portable and would therefore be applicable 
to data sets such as those of the Million Veteran 
Program, UK Biobank, and the All of Us Re- 
search Program (All of Us is a service mark of the 
U.S. Department of Health and Human Services). 
Applied to such large populations, this method 
could facilitate the discovery of pathogenic var- 
iants, refine estimates of penetrance across 
diverse populations, and provide a more nuanced 
understanding of inheritance patterns, which 
this study suggests may be more complex than 
merely “recessive” or “dominant” for some genes. 
Incorporation of richer EHR data, such as lab- 
oratory results and clinical notes (37), could in- 
crease the resolving power of PheRS. Furthermore, 
this method may be used with other combina- 
tions of phenotypes that do not follow estab- 
lished Mendelian patterns, perhaps based on 
undiagnosed patients with unusual presentations. 

The American College of Medical Genetics and 
Genomics established guidelines for variant inter- 
pretation that reflect the need to combine multiple 
lines of evidence, including population-based 
genotype-phenotype correlations (32). Our method 
provides a high-throughput means to generate 


such evidence. Using these guidelines, 10 of the 
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Fig. 4. PheRS enriches for variants with altered function in vitro. Representative Western blots 
(A) and mean pERK2 levels normalized to erythropoietin receptor (EPOR) expression (B) in EPO- 
stimulated HEK293T cells transiently transfected with wild-type (WT) or variant SH2B3 constructs, 
EPOR, and JAK2. As expected, known functional mutation SH2B3-R392E fails to inhibit EPO- 
stimulated ERK phosphorylation. Similarly, SH2B3-E395K shows ~1.8-fold elevation (relative to 

WT SH2B3) of EPO-stimulated ERK activation at 10 min. Reverse transcription PCR analysis (C) and 
quantification (D) of WT versus variant splicing of SUOX and TG exons in HEK293T cells transiently 
transfected with empty minigene vector pETO1, pETO1 containing exons of interest flanked by 

100 base pairs (bp) of intronic sequence, or negative control pIRES2-EGFP (enhanced green 
fluorescent protein). Absolute change in the percent of exon inclusion was —61% for the SUOX variant 
and -39% for the TG variant. Means + SEM; n = 4 [(A) and (B)] or 5 [(C) and (D)] independent 
experiments; unpaired two-tailed t test; *P = 0.003, **P < 0.001. 


variants from the discovery analysis were inter- 
preted as having “uncertain significance.” By add- 
ing data from the PheRS analysis, in combination 
with evidence from our in vitro studies, four of 
these variants could be converted to “likely path- 
ogenic” or “pathogenic” (Table 1 and table S10). 
Our findings suggest that the phenotypic bur- 
den of rare variants in Mendelian genes may be 
greater than previously thought. A combination 
of PheRS and sequencing identified symptomatic 
individuals with genetics consistent with estab- 
lished inheritance patterns—heterozygous indi- 
viduals for dominant genes (SH2B3 and PLCG2) 
and individuals with confirmed second variants 
in recessive genes (DGKE and AGXT)—none of 
whom were diagnosed with a genetic condition. 
A much larger number of individuals were het- 
erozygous for variants in genes with a presumed 
recessive inheritance and yet still had symptoms 
consistent with the Mendelian disease pattern. 
Although we cannot exclude the possible influ- 
ence of structural or noncoding variants, the evi- 
dence suggests that these variants increase risk 
in heterozygotes. These individuals tend to have 
disease that is mild compared with the classic 
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presentations but severe relative to the general 
population. For example, homozygous pathogenic 
mutations in 7G are associated with congenital 
goiter, which often progresses to thyroid carcinoma; 
our most severely affected heterozygote received 
a thyroidectomy at age 26 for goiter and thyroid 
carcinoma. 

This work adds to the evidence that Mende- 
lian and complex disease are not dichotomous, 
but rather exist on a spectrum. As a method that 
is both high-throughput and sensitive to the vast 
knowledge already acquired, PheRS is a tool that 
may help bridge the gap between Mendelian and 
complex disease. A consequential question is 
whether the treatments designed for a Mende- 
lian condition could be effective in individuals 
with nontraditional molecular presentations. Of 
the 17 diseases represented among those patients 
with suspected but undiagnosed Mendelian dis- 
ease, 11 have specific treatments available (table 
$11), some of which could alter the long-term 
course of the disease. 

The impact that this approach will have on 
accelerating precision medicine depends on three 
interrelated challenges. First, we must integrate 
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statistical associations generated with PheRS into 
guidelines used for variant interpretation. Sec- 
ond, as we collect stronger evidence for the phe- 
notypic effects of rare variants, we must learn to 
rapidly and effectively integrate that knowledge 
into clinical care. Third, we must determine 
whether PheRS can be used to prospectively iden- 
tify patients whose symptoms are caused by var- 
iants in Mendelian genes. If these challenges are 
addressed, approaches like ours may ultimately 
enable the conversion of big data not just to 
knowledge but also to improved care and out- 
comes for patients. 
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POLYMERS 


Random heteropolymers 
preserve protein function 
in foreign environments 


Brian Panganiban,’ Baofu Qiao,” Tao Jiang,’ Christopher DelRe,’ Mona M. Obadia,” 
Trung Dac Nguyen,” Anton A. A. Smith,” Aaron Hall,’ Izaac Sit,’ Marquise G. Crosby,* 
Patrick B. Dennis,* Eric Drockenmuller,? Monica Olvera de la Cruz,” Ting Xu?©75* 


The successful incorporation of active proteins into synthetic polymers could lead to a new 
class of materials with functions found only in living systems. However, proteins rarely 
function under the conditions suitable for polymer processing. On the basis of an analysis 
of trends in protein sequences and characteristic chemical patterns on protein surfaces, 
we designed four-monomer random heteropolymers to mimic intrinsically disordered 
proteins for protein solubilization and stabilization in non-native environments. The 
heteropolymers, with optimized composition and statistical monomer distribution, enable 
cell-free synthesis of membrane proteins with proper protein folding for transport and 
enzyme-containing plastics for toxin bioremediation. Controlling the statistical monomer 
distribution in a heteropolymer, rather than the specific monomer sequence, affords a new 
strategy to interface with biological systems for protein-based biomaterials. 


ature’s building blocks, such as proteins 

and biomachinery, have many features 

unmatched by synthetic counterparts, in- 

cluding chemical diversity, hierarchical 

structure, programmed system dynamics, 
and efficient energy conversion. Despite years of 
effort to stabilize proteins outside of their native 
environments, there has been limited develop- 
ment in interfacing biological and synthetic com- 
ponents without compromising their structures 
and inherent functions. Nonaqueous enzymology 
via reverse micelles can only maintain a fraction 
(<20%) of native activity (1); amphipols solubilize 
membrane proteins, but not for water-soluble pro- 
teins in organic solvents (2); polymer conjugation 
relies on accessibility of protein-functional groups 
(3); and sol-gel confinement limits protein acces- 
sibility and postintegration (4). 

A chaperone-like polymeric shell outside of a 
protein may effectively improve protein solubil- 
ity and stability in organic solvents by providing 
a barrier to resist both organic solvent exposure 
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and protein conformational change. To form such 
a nanoscopic polymeric shell, protein-polymer 
interactions need to be strong enough to favor 
adsorption yet soft enough not to outcompete 
the forces governing protein folding. The struc- 
ture and activity of natural building blocks are 
governed by multiple noncovalent interactions 
that are subject to change with small perturba- 
tions (5). When a homopolymer is conjugated to 
a protein surface, monomer-amino acid inter- 
actions can affect protein folding (6, 7) and de- 
form the polymer chain conformation (8). In 
nature, intrinsically disordered proteins adopt 
local chain conformations, often achieved by 
multiple weak binding sites, and mediate various 
processes (9, 10). Amphiphilic heteropolymers mi- 
micking disordered proteins may offer a versatile 
approach for protein solubilization and stabiliza- 
tion given, the diversity and complexity of proteins. 

Protein surfaces are chemically diverse and 
heterogeneous (Fig. 1A). Surface analysis of fold- 
ed water-soluble proteins based on hydropho- 
bicity or charge shows characteristic patch size 
distributions. The typical patch size is 1 to 2 nm 
in diameter, with an interpatch distance of 1 to 
2 nm (Fig. 1B and fig. $1). Sequence analysis of 
water-soluble proteins, performed by assign- 
ing amino acids as either hydrophilic or hydro- 
phobic, using glycine as the reference (11), shows 
that the block length of amino acids with similar 
hydrophobicity tends to be less than 10 (fig. S2). 
Matching the statistical chemical pattern has 
been shown to be critical in de novo protein de- 
sign (2, 13) and in modulating polymer/surface 
interactions (74). Rather than synthesizing sequence- 
specific polymers (75), synthetic heteropolymers 
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with similar chemical features and spatial dis- 
tributions of side chains to match the surface 
pattern of natural proteins may behave like dis- 
ordered proteins. 

Recent developments in reversible deactivation 
radical polymerization make it feasible to synthe- 
size random heteropolymers with reliable control 
over the statistical monomer distribution (76-18). 
Four methacrylate-based monomers are selected 
to impart chemical diversity in heteropolymers and 
to optimize short-range polymer-protein interactions 
(Fig. 1C and fig. S3): methyl methacrylate (MMA), 
oligo(ethylene glycol) methacrylate [OEGMA; 
number-average molecular weight (/,,) = 500 Da], 
2-ethylhexyl methacrylate (2-EHMA), and 3- 
sulfopropyl methacrylate potassium salt (3-SPMA). 
MMA is chosen to tailor the overall hydrophobic- 
ity for protein solubilization, to anchor the chain 
at the polar-nonpolar interface, and to reduce the 
entropic penalty associated with adjusting its 
local conformation. OEGMA is chosen to leverage 
the well-known ability of poly(ethylene glycol) 
(PEG) to stabilize proteins. 2-EHMA and 3-SPMA 
are chosen to interact with the hydrophobic and 
positively charged residues on the protein surface, 
respectively. The selection of the monomer ratio 
is guided by the calculated solubility parame- 
ters (19, 20) and through experimental screenings 
to achieve polymer dispersion in both aqueous 
and organic media with the best retention in 
enzyme activity. Our best-performing hetero- 
polymer, called “RHP,” has a compositional ratio of 
5(MMA):2.5(OEGMA):2(2-EHMA):0.5(3-SPMA). 
Using the well-established Mayo-Lewis equation, 
we estimated the statistical monomer distribu- 
tion along the RHP chains (21, 22). The histo- 
gram of monomer block size confirms the absence 
of long blocks of the same monomer that could 
interfere with native protein structure (figs. S5 to 
87). The RHP used has a M, of ~30 kDa and a 
dispersity of 1.3. At least 12 batches of RHP were 
synthesized with excellent reproducibility (fig. S11). 

Computational studies were performed on mix- 
tures of a common enzyme, horseradish peroxide 
(HRP), and RHP in both an aqueous buffer solu- 
tion and toluene in order to elucidate the polymer- 
protein adsorption mechanism. Explicit solvent 
all-atom molecular dynamics (MD) simulations 
were performed by using the CHARMM 36 po- 
tential (supplementary materials, section S3). The 
RHPs are built with compositions and degrees of 
polymerization close to the corresponding exper- 
imental values. The last simulation snapshots of 
the RHP/HRP mixture in toluene and in water 
are shown in Fig. 2, A and B, respectively. In 
water, RHP and HRP are loosely complexed, and 
only ~40% of HRP surface is covered by RHP, 
which is in line with the experimental observa- 
tions that RHP is soluble in water. However, the 
HRP surface is fully covered by RHPs in toluene; 
the complex is stable, and no protein structure 
change was observed over the simulation dura- 
tion of 0.6 us. The radial distribution of RHPs 
around the protein’s center of mass was calcu- 
lated in toluene (Fig. 2C) and in water (fig. S17E). 
In toluene, the hydrophilic monomers of the 
polymer (OEGMA and 3-SPMA) were found to 
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point inward, adjacent to the protein, whereas 
the hydrophobic monomers (MMA and 2-EHMA) 
were located on the outside, in contact with the 
toluene. By contrast, the favored orientations of 
the RHP side chains become weaker in water. 
These results suggest that once positioned close 
to the protein surface, the RHPs can adjust their 
conformations to maximize protein-polymer in- 
teractions, which correlates well with our hy- 
pothesis. Correlations between the protein surface 
groups and their nearest monomer neighbors 
are computed by categorizing Ala, Val, Ile, Leu, 
Met, Phe, Tyr, Trp, Gly, and Pro amino acids as 
hydrophobic and the others as hydrophilic. Ap- 
proximately 70% of the protein surface is covered 
by the hydrophilic monomers; 50 + 4% originates 
from hydrophilic monomer-hydrophilic amino 
acid interactions, contributing -800 + 300 kJ/mol 
to the energy (Fig. 2D). This shows the importance 
of the local protein surface-polymer interactions in 
stabilizing the protein structure. The shell formed 
by the polymer around the HRP core in organic 
solvents improves protein solubility and stability 
by providing a barrier to resist both organic sol- 
vent exposure and protein conformational change. 

The role of the RHPs’ composition on their 
ability to encapsulate the protein in water and 
toluene was investigated by using a coarse-grained 
(CG) model based on the data gathered from 
all-atom MD simulations and experiments (Fig. 
2E and supplementary materials, section S4). A 
representative snapshot of the RHP encapsulat- 
ing the protein is given in Fig. 2F. The surface 
coverage increases as the attraction strength 
between the adsorbing monomers and protein- 
attractive sites, €;,, increases (Fig. 2G) for all 
solvent selectivity conditions (captured by the 
attraction strength between the polymer solvo- 
phobic beads, €pn). For sufficiently high ey, there 
exists an optimal value of the fraction of ad- 
sorbed monomers (oq) that maximizes the sur- 
face coverage (Fig. 2H). The spatial correlations 
between the adsorbing monomers and the protein- 
attractive sites (fig. S22A) demonstrate that the 
RHPs tend to adopt energetically favored confor- 
mations once in contact with the protein to com- 
pensate for the entropic penalty associated with 
confinement to the surface (23). 

We experimentally tested our hypothesis that 
RHP is capable of interacting with proteins and 
mediating their interactions with the local en- 
vironment. We performed cell-free synthesis of 
membrane proteins that eliminates potential 
interference or assistance from the host cell 
physiology for protein folding and lipid insertion 
during translation (24). Once the plasmids for a 
model transmembrane protein—oligopeptide/ 
proton symporter PepTso or water channel aqua- 
porin Z (AqpZ)—are added, the translation and 
folding status of the membrane protein is scored 
according to the fluorescence intensity of a C 
terminus-fused green fluorescent protein (GFP). 
Western blotting analysis (Fig. 3A) using anti- 
bodies to GFP confirmed PepTso-GFP expres- 
sion regardless of the presence of RHP. Without 
RHP, no GFP fluorescence is detected, suggesting 
that protein folding is not correct to gain GFP 


Panganiban et al., Science 359, 1239-1243 (2018) 


fluorescence (25). Control experiments using 
amphipol (A8-35), which is effective in stabilizing 
membrane proteins after protein expression, show 
little GFP fluorescence for AqpZ-GFP (2). However, 
there is a ~15-fold increase in GFP fluorescence 
when PepTso-GFP or AqpZ-GFP is expressed in the 
presence of RHP at a RHP:ribosome mole ratio of 
50:1 (Fig. 3B). The resultant GFP fluorescence is 
more than an order of magnitude higher than 
that of commonly used liposome. To further 
verify the protein folding, the cell-free synthe- 
sized PepTso was reconstituted in liposomes and 
tested in a pyranine-based proton transport assay 
(Fig. 3C) (26). Proton transport was detected by 
adding valinomycin to the outer solution con- 
taining a dipeptide, Ala-Ala (Fig. 3D). The cell- 
free synthesized PepTso, in the presence of RHP, 


can fold properly to retain transport function in 
a lipid environment. Thus, RHP is able to chap- 
erone proper protein folding and solubilize mem- 
brane proteins in aqueous solution but will not 
outcompete the protein-lipid interaction, inter- 
fere with lipid insertion, or compromise transport 
function. 

We investigated protein dispersion and sta- 
bilization in organic solvents, a requirement to 
interface proteins with synthetic building blocks 
toward protein-based materials. Previous non- 
aqueous enzymology studies of HRP observed 
only less than 1% of its native activity in hydro- 
phobic solvents that can extract the heme co- 
factor (27). For all studies, proteins and RHPs 
were first cosolubilized in deionized water, lyophi- 
lized, and subsequently resuspended in solvents. 
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Fig. 1. Design of random heteropolymers based on protein surface pattern for protein solubili- 
zation and stabilization in organic solvents. (A) As seen in the color-coded proteins—HRP(1H55), 
GOx(1CF3), GFP(2HRW), and a-CT(1YPH)—the protein surfaces are chemically heterogeneous, with 
characteristic patch size and interpatch distance. Neutral hydrophilic, blue; hydrophobic, red; positively 
charged, green; negatively charged, purple. (B) The histograms of the patch diameter and the 
interpatch distance show that the length centers around 1 to 2 nm. Analysis of hydrophobic patches 
is shown. Analysis of negatively and positively charged patches is shown in fig. Sl. (©) Designed 
random heteropolymer with statistical distribution of monomers with varied hydrophobicity matching 
that of the protein. (D) The RHP can coassemble with protein (schematically shown as a patchy 
particle) in organic media in which the random heteropolymer adjusts its local conformation to 
maximize protein-heteropolymer interaction without denaturing the protein's local structure. 
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The RHP/HRP complexes were readily soluble 
in common solvents for material synthesis and 
processing such as toluene and chloroform. 
Transmission electron microscopy (TEM) results 
(Fig. 4A) show that RHP/HRP complexes form 


nanoparticles with a diameter of ~50 to 60 nm. 
Fourier transform-infrared spectroscopy (FTIR) 
spectra were collected after dissolving RHP/ 
HRP complexes in toluene over 24 hours. The 
amide I band and its corresponding negative 
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Fig. 3. Random heteropolymers can 
effectively mediate interaction 
between membrane protein 

and aqueous solution and assist 
protein folding during translation in 
cell-free synthesis. (A) Western blot 
analysis of protein expression with and 
without RHP in a cell-free synthesis 
assay. The left lane shows color-coded 
protein mass standards. (B) The level 
of proper protein folding of GFP-fused 
AqpZ is significantly enhanced with the 
presence of RHP (n = 3 independent 
measurements) in comparison with 
other surfactants, such as amphipol 
A8-35 and dioleoylphosphatidylcholine 
(DOPC) unilamellar liposome. (©) Schematics 
of assay to evaluate PepTso transport 
properties upon insertion into liposomes. 
(D) Cell-free synthesized PepTso in 

the presence of RHP can fold and inset 
into lipids properly, retaining transport 
function across the membrane. 
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Fig. 2. All-atom and CG molecular dynamics simulation results. 

(A and B) Three-dimensional spatial distribution of polymer backbone 
(cyan) around HRP in (A) toluene and (B) water. The isosurfaces are 
plotted for densities 50 times the average. The hydrophilic and hydropho- 
bic protein surfaces are colored in blue and red, respectively. (C) Radial 
density profiles of the polymer backbone and the tail atoms of the four 
monomers (fig. S12) around the center of mass of HRP backbone in 
toluene. (D) Correlation probability (in percent) and energy (kilojoule per 
mole) between the hydrophilic (P, blue) and hydrophobic (H, red) protein 
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second derivative revealed minimal change in the 
HRP’s secondary structure in toluene (Fig. 4B). 
There is little change (<2 nm) in the peak posi- 
tion in the ultraviolet (UV)-visible spectra of the 
heme cofactor in HRP over 24 hours (Fig. 4C), 


100 
Epp/kel = 0. 
G me eni/kgl = 05 
S gq L — awhkal =08 
= —~ eqpkgl = 1.2 
© 
: 60 
S$ | 
3 
© 40 
ra) 
S 
© 20 
ao 
0 
05 10 15 20 25 
Euh/kel 
100 I= Sphel=15 °° ~S 
-o gyp/kpl = 1.2 
go b > &H pl = 1.0 
= un/kel = 0.5 


Surface coverage (%) 
oO 
oO i=} 
—— 
| : 
AX 


810Z ‘G} yore Uo /G10' Bewsouel0s‘e0ue!0S//:d}]y WOd| pepeojuMOGg 


03°04 05 06 0.7 
na 


surface and the hydrophilic (p, light blue) and hydrophobic (h, orange) 
polymer nearest neighbors in toluene. The uncertainties refer to the 
standard deviations. (E) CG models of HRP and a RHP chain, where the red 
beads represent the protein-attractive sites and the orange beads 
represent the adsorbing monomers. (F) A snapshot of the RHP chains 
covering the HRP. (G) The surface coverage as a function of the adsorption 
strength. (H) The surface coverage as a function of the MMA composition. 
An example of the system corresponding to the circled data point is 
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Fig. 4. Random heteropolymers preserve enzymes in organic solvents and enable processing 


of protein-containing functional materials. (A) 


TEM image of RHP/HRP dried from toluene 


solution. The RHP/HRP mixture forms ~50- to 60-nm-diameter nanoparticles (red circles). 

(B) FTIR spectra of the RHP/HRP complex in toluene over 24 hours, suggesting minimal changes in 
the protein secondary structure. (C) The peak position of the UV-visible spectrum of the heme 
cofactor in RHP/HRP complex in toluene over 24 hours, confirming the integrity of its protein tertiary 
structure and enzymatic pocket. (D) In the presence of RHP, ~80% of native HRP activity is 
retained after being stored in toluene over 24 hours (n = 3 independent measurements). Results 


from HRP encapsulated in reverse micelles based 


on sodium bis(2-ethylhexyl) sulfosuccinate (AOT) 


(<25%) or polystyrene-block-poly(ethylene oxide) (PS-b-PEO) polymeric surfactant (<18%) are 
shown for comparison. (E) Enzymatic activity of OPH as a function of storage media for OPH and 
substrate. Activity seven times higher than control experiments in buffer can be obtained when 
OPH and the pesticide methy! parathion are cosolubilized in toluene. (F) Electrospun PEO- or 
PMMA-based fiber mats containing RHP/OPH complex can perform on-demand bioremediation. 


The hydrolysis by-products are trapped in the RH 


confirming the retention of HRP tertiary struc- 
ture and stability of the heme-binding pocket 
in toluene, which is critical for enzymatic activ- 
ity. To evaluate the retention of the HRP activ- 
ity after suspension in toluene, aliquots of RHP/ 
HRP toluene solution were dispersed in aqueous 
solution in order to perform a colorimetric 
assay. With the presence of RHP, ~80% of HRP 
native activity was maintained after 24 hours 
suspended in toluene (Fig. 4D). This is several 
orders of magnitude higher than HRP alone 
in nonaqueous media and at least four times 
higher than the best reported value when using 
molecular and/or polymeric surfactants (J). 
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P/OPH/PEO fiber mat for easy removal. 


We explored the versatility and universality 
of RHP by extending the study to other proteins. 
PEGylation is effective for protein dispersion 
and stabilization but is not effective for proteins 
with a small number of functionalizable groups, 
such as glucose oxidase (GOx) (28). (fig. S24). The 
PEGylation is sufficient for GOx dispersion, but 
the proteins retain less than 10% of native activ- 
ity after 2 hours (fig. S24). By contrast, RHP/GOx 
shows ~50% of native activity after 24 hours of 
storage in toluene. Proteins with different struc- 
tures, such as B-barrel GFP, are also tested. After 
RHP/GFP complex is solubilized in organic sol- 


vents, there is no detectable shift in fluorescent 
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emission peak maxima and minimal decrease in 
the emitted fluorescence over 24 hours (fig. S25). 
These results confirm that the environment in the 
B-barrel interior remains the same and suggest 
that there is minimal toluene penetration into the 
protein core. 

The RHP-enabled protein solubilization and 
stabilization can lead to technologically relevant 
protein-based materials. Organophosphorus hy- 
drolase (OPH) was chosen for its excellent ability 
to degrade organophosphates, which are com- 
monly used as insecticides and chemical warfare 
agents (29). However, OPH becomes inactive even 
in its partially folded dimeric state (30), and 
organophosphates typically have poor solubility 
in aqueous solution. There is a need to retain 
OPH activity while cosolubilized with organo- 
phosphates in order to realize on-demand bio- 
remediation for these acute toxins. We evaluated 
OPH activity using a 10 mM preparation of the 
well-known pesticide methyl parathion (MP). 
RHP/OPH can be readily solubilized in toluene 
and chloroform and resuspended in buffer solu- 
tion. RHP/OPH retains 80 + 5.6% (n = 3 inde- 
pendent measurements) of the initial OPH activity 
after 24 hours’ suspension in toluene. When 
methyl parathion was codissolved with RHP/OPH 
in toluene, dried, and assayed, the OPH activity 
was more than seven times higher than that of 
pure OPH in aqueous solution (Fig. 4E). This is 
attributed to the higher substrate concentra- 
tion in toluene and RHP’s ability to stabilize 
OPH in both aqueous and organic media. The 
successful dispersion and stabilization of RHP/ 
OPH complexes enables coprocessing of OPH 
and synthetic polymers (Fig. 4F). Fiber mats based 
on either polyethylene oxide (PEO) or PMMA 
were prepared by means of electrospinning and 
tested for bioremediation. Both fiber mats were 
active and degraded MP, weighing approximate- 
ly 1/10 of the total fiber mat, in a few minutes. In 
particular, when the RHP/OPH/PEO fiber mat 
was soaked in MP-containing toluene solution 
and assayed in buffer, the hydrolysis by-products 
could be trapped for easy removal. This enables 
in situ toxin remediation without preprocessing, 
transfer, or contact with the agent before and 
after detoxification. 

Our studies confirm that random heteropoly- 
mers designed on the basis of statistical monomer 
distribution are effective at maintaining protein 
function in foreign environments. Using only one 
design of random heteropolymer, we demonstrated 
successful cell-free synthesis of membrane pro- 
teins with proper folding and retention of pro- 
tein activity in organic solvents for a wide range 
of proteins. These hybrid materials will not only 
take advantage of the precision and efficiency of 
natural building blocks but also enable reactions 
and processes on demand where common chem- 
istry necessities are unavailable. 
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Real-time imaging of adatom-promoted 
graphene growth on nickel 


Laerte L. Patera,”?* Federico Bianchini,”* Cristina Africh,”+ Carlo Dri,”?+ 
German Soldano,* Marcelo M. Mariscal,* Maria Peressi,°>+ Giovanni Comelli?” 


Single adatoms are expected to participate in many processes occurring at solid surfaces, 
such as the growth of graphene on metals. We demonstrate, both experimentally and 
theoretically, the catalytic role played by single metal adatoms during the technologically 
relevant process of graphene growth on nickel (Ni). The catalytic action of individual 

Ni atoms at the edges of a growing graphene flake was directly captured by scanning 
tunneling microscopy imaging at the millisecond time scale, while force field molecular 
dynamics and density functional theory calculations rationalize the experimental 
observations. Our results unveil the mechanism governing the activity of a single-atom 


catalyst at work. 


educed dimensionality in the catalyst struc- 
ture often leads to increased chemical re- 
activity (7, 2). More specifically, atomic 
steps of metal surfaces were experimen- 
tally identified as the active sites for sev- 
eral heterogeneous catalytic reactions (3, 4), and 
this identification was theoretically rationalized 
in terms of a characteristic upshift of the d-band 
center (5). Such undercoordinated sites play also 
a crucial role in governing the stability of the 
active phase (4). Single atoms, representing the 
lowest possible coordination, can exhibit even 
higher specificity and efficiency compared to 
stepped surfaces (7). For example, isolated single 


metal atoms anchored onto oxide nanocrystal- 
lites promote CO oxidation (6) and the water- 
gas-shift reaction (7). Recent studies also revealed 
the peculiar activity of single-atom alloys in boost- 
ing the selective hydrogenation of alkenes and 
hydrocarbons (8). 

Mobile adatoms, commonly present on metal 
surfaces at elevated temperatures, are involved 
in several chemical processes occurring on sur- 
faces and at the solid-liquid interface (9, 10). 
However, the lack of direct experimental ob- 
servation of the dynamics of these peculiar pro- 
cesses at the atomic scale, generally owing to 
the difficulty of achieving simultaneous spa- 


tial and temporal resolution, limits the descrip- 
tion and understanding of the surface reaction 
pathways. For the specific case of the chemical 
vapor deposition (CVD) growth of graphene on 
transition metals, a catalytic role of single metal 
atoms was predicted by theorists, but the de- 
tailed growth mechanism at the atomic scale 
remains unclear (1). 

Nickel (Ni) is one of the most used substrates 
to synthesize graphitic carbon single layers (12). 
We report the characterization of graphene growth 
on Ni(111) in real time by means of in situ high- 
speed scanning tunneling microscopy (STM) mea- 
surements. By monitoring the layer formation 
at the atomic scale and with a time resolution 
down to milliseconds, we observed at the kink 
sites of the graphene edges single Ni atoms 
that are directly involved in the growth process. 
Simulations of possible reaction paths based 
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Fig. 1. Graphene growth along z and k edges. (A) Zigzag (z) and Klein (k) edges of a top-fcc EG layer on Ni (111). At both edges, the kink structures are 
highlighted by circles. (B) High-speed STM sequence acquired at 710 K in quasi-constant height mode at the z edge, from movie S1 (V = 20 mV; / = 8 nA; 
frame rate = 36.5 Hz]. White arrows indicate the position of C atoms in fcec-hollow sites near the kink. (©) Same as in (B) but for the k edge, from movie 


S2 (V = 20 mV; | = 7 nA; frame rate = 60 Hz). 
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on density functional theory (DFT) revealed that 
Ni adatoms drive the carbon (C) atom addition 
mechanism by substantially lowering the cor- 
responding reaction barriers. 

Islands of monolayer epitaxial graphene (EG) 
were synthesized on Ni(111) by subsurface C seg- 
regation at temperatures between 700 and 740 K 
(13). Three coexisting stable chemisorbed con- 
figurations of EG were observed (14), with the 
top/face-centered cubic geometry (top-fcc) as 
the most abundant phase. During growth, the 
island edges evolved too rapidly to be resolved 
by standard STM imaging (typical image acqui- 
sition rates of ~0.02 frames/s). Nevertheless, in a 
previous study, we succeeded in identifying the 
top-fec edge structure by performing high-speed 
STM measurements (15), exploiting the FAST 
module developed in our laboratory (16, 17). The 
module allows for image acquisition rates up 
to 100 frames/s with commercial microscopes 
already optimized for structural and reactivity 
studies. The edge terminations of EG islands on 
Ni(i11) during growth at temperatures higher 
than 570 K (15) were identified as the zigzag (z) 
and Klein (k) structures shown in Fig. 1A. For 
both edges, kink sites obtained by interrupting 
the last two C rows were predicted to play an im- 
portant role in graphene edge expansion (/8, 19). 

To identify the underlying growth mechanisms 
during the C-attachment process at both edges, 
we acquired STM movies with high spatial and 
temporal resolution and long enough for a sta- 
tistically relevant analysis. 

Series of consecutive images along z [movie 
S1 (20)] and k (movie S2) edges were acquired 
with frame rates of 36.5 and 60 Hz, respectively. 
The use of different frame rates (from 4 to 10 Hz) 
and tunneling current setpoints (from 500 pA 
to 10 nA) did not yield substantial differences, in- 
dicating a negligible effect of the fast scanning 
tip on the observed processes. This consistency 
presumably resulted from the short tip-surface 
interaction time (typically ~1.5 to 10 us/ pixel) 
and from the thermal excitation that at elevated 
temperatures likely overwhelmed tip-induced 
effects (21). 

Movie S1 initially shows the formation at the 
z edge of two kink sites, acting as nucleation 
centers for graphene growth (fig. S1). In the im- 
age sequence in Fig. 1B, growth proceeds from 
the kink site in the middle by orderly completion 
of the two interrupted carbon rows while moving 
the kink ahead. During the z edge growth, the 
observed structure of the kink remains equivalent 
to the initial one, probably because of the pref- 
erence for fcec-hollow site termination (15). A 
similar evolution was predicted by previous DFT 
calculations for the z edge of EG on Cu(111), 
where the growth would start from the hexagon 
forming the kink, and continuous incorporation 
of C atoms would form a new z chain on the 
graphene growth front (19). The image sequence 
in Fig. 1C, extracted from movie S2, shows the 
growth mechanism at the k edge, similar to the 
one described above for the z edge, i.e., proceed- 
ing from the kink site and parallel to the edge. 
Our DFT calculations (fig. S2) support this obser- 


Patera et al., Science 359, 1243-1246 (2018) 


vation, showing that the incorporation of new 
C atoms takes place at the kink and that, after 
addition of a first C atom in a top site, the ener- 
getics favor the addition of a second C atom in 
a fec-hollow site. 

A closer examination of both sequences in 
Fig. 1, B and C, reveals the presence of bright ob- 
jects at the kink sites. These features appeared 
and disappeared: They were present in some 
frames (at 27 and 82 ms in Fig. 1B, and at 17 and 
50 ms in Fig. 1C) and were absent in others (at 0 
and 55 ms in Fig. 1B, and at 0 and 33 ms in Fig. 


. ; <1s@ 


1C). They were typically imaged for few scan- 
lines only, indicating a short residence time of 
the originating species (at the millisecond time 
scale; see supplementary text), and their appear- 
ance did not depend on scanning parameters and 
tip conditions. Regarding the nature of the imaged 
species, one possibility would be that they are 
mobile C clusters, as described for graphene 
growth on Ru(0001) and Rh(111) (22, 23). How- 
ever, for Rh(111), where STM images have been 
acquired, the clusters have a height similar to 
that of the graphene layer, at variance with what 


Reseteses 


Fig. 2. Nickel adatoms at the graphene edges. (A) Ni adatom diffusing on a surface in a region 
delimited by graphene z (right) and k (left) edges with kinks. Two representative trajectories obtained 
by MD simulations with ReaxFF performed at 710 K for 100 ps are shown. Color palette for Ni 
trajectories: from green (initial position) to white (final position). The final steps are highlighted by 
increasing the ball size. (B to 1) Short-lived configurations of Ni adatom at k-edge kinks: (B and F) 
High-speed STM images from movie S2 (20) (V = 20 mV; 1 = 7 nA); (C and G) Laplace-filtered version 
of images from (B) and (F) with superimposed ball models; (D and H) constant-height STM 
simulated images based on the calculated geometries (E and |). 
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2.46 eV 


0" step 
+1 C (top) 


Fig. 3. Alternative graphene growth pathways as predicted by 
DFT. Energy zero corresponds to the bare kink configuration (0), plus 
one Ni adatom on the surface far from the edge and a reservoir of 


we observe. Apparent height and lateral dimen- 
sions of the mobile species are, on the contrary, 
comparable to those of the static pointlike de- 
fects observed after CVD graphene growth on 
Ni(111) in the same temperature range, which 
were previously identified as substitutional Ni 
atoms trapped in the graphene network during 
the growth process (13). We therefore attribute 
the bright features in Fig. 1, B and C, to mobile 
Ni adatoms, in line with previous DFT-based 
prediction (24). This hypothesis is supported 
by the well-known presence of diffusing adatoms 
on metal surfaces at elevated temperatures (25). 
Other dashed features with lower apparent height 
(comparable to or lower than that of carbon atoms 
in graphene) were visible in some frames and 
can be attributed either to Ni imaged by the tip in 
an off-center position (26), or to moving C atoms. 
A further confirmation of our identification of 
the bright features in the STM images as Ni 
adatoms comes from molecular dynamics (MD) 
simulations with state-of-the-art reactive force 
field (ReaxFF) performed at 710 K [Fig. 2A; de- 
tails in (20)]. Mobile Ni adatoms moved randomly 
over the bare metal surface until they reached 
a graphene edge. There, they diffused parallel to 
the edge, with considerably longer residence time 
in the kink sites. 

A notable feature in the STM image series is 
that the presence of the Ni adatoms at the 
kinks is in most of the cases accompanied by C 
dimer attachment nearby (see frame at 55 ms in 
Fig. 1B, and frames at 33 and 50 ms in Fig. 10), 
suggesting a catalytic role of the single metal 
atom. This observation is consistent with pre- 
vious DFT calculations for graphene on Cu(111) 
(19), which showed that the barrier for C in- 
corporation at the z edge was strongly reduced 
by Cu adatom passivation at armchair-like sites, 
present near a hexagon (a kink) sticking out of 
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without Ni adatom 


1° step 
(1) ———> (2) 


| +1 C (fcc) (1) 


2" step 


+1Ni 


respectively. 


the edge. Our interpretation is also in line with 
recent in situ transmission electron microscopy 
reports of the chemical activity of single atoms 
in facilitating addition and removal of C atoms 
from the edges of suspended graphene flakes 
(27). These previous results, although notable, 
are not related to a real growth process, but rather 
are related to attachment-detachment events of 
individual C atoms, and thus cannot be directly 
extended to technologically relevant CVD growth 
processes on solid metal surfaces. 

That the Ni adatoms were almost exclusively 
imaged at the kink sites suggests that they rep- 
resent a minimum-energy configuration for Ni; 
nevertheless, the short residence time implies 
that the bond can be easily broken at these tem- 
peratures, avoiding the poisoning of the active 
kink site. No stable Ni trapping was observed in 
our time series, at variance with the defect for- 
mation pathway predicted by Wang et al. (24). 

From the STM sequence corresponding to 
the k edge, two short-lived Ni adatom config- 
urations can be identified at the kink sites (Fig. 2, 
B, C, F, and G). DFT simulations of these con- 
figurations yielded the two stable structures 
shown in Fig. 2, E and I, where a Ni adatom and 
one or two top C atoms were added to the bare 
kink. The corresponding constant-height simu- 
lated images in Fig. 2, D and H, match the ex- 
perimental STM data. DFT calculations indicate 
a pronounced mutual stabilization effect be- 
tween C and Ni adatoms at the kink (from the 
energy comparison of configurations in figs. 
$2 and S3). These results suggest that Ni had a 
strong promoting role in the addition of C atoms 
to the kink and hence in the edge growth, as 
previously predicted for Cu(111) (19). Similar con- 
clusions could be drawn for the z edge, where a 
single intermediate state was imaged and de- 
scribed by DFT calculations (fig. $4). 
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————— 
+1 C (top) 


with Ni adatom 


2) —————> (3) | | 
+1 C (top) 1° step 2" step 


(2') (3') 


——_———_——_> 
+1 C (fcc) 


sub-fcc C atoms far from the edge. For clarity, the energy scale is 
reset for each cycle, producing the blue and red energy scales, 


The stable configurations observed by STM 
and confirmed by theory can serve as a guide for 
a DFT description of the entire growth process, 
discriminating among many possible alternative 
paths and highlighting the catalytic role of the 
Ni adatom. In Fig. 3, we report the energy di- 
agram corresponding to selected C-attachment 
pathways to the k edge with and without the 
presence of the Ni adatom. The energy gain upon 
addition of a C atom is referred to a subsurface 
position far from the edge [“sub-fcc,” which is 
the most stable position on a clean Ni(111) sur- 
face]; for the addition of a Ni adatom, to a po- 
sition on the surface far from the edge. 

We take as initial configuration [labeled (0) 
in Fig. 3] a k edge with all terminating C atoms 
in fec-hollow. The addition of a first C atom on 
the top site (1) involves a considerable energy 
barrier, 2.46 eV. The growth process can then fol- 
low two alternative cycles, either without (blue 
box) or with (red box) a Ni adatom. 

Considering the process without a Ni adatom, 
the attachment of the second C atom in a fcc- 
hollow position (2) requires overcoming a bar- 
rier of 1.02 eV (see fig. S5 for details on energy 
barriers). Because of the epitaxial match between 
graphene and Ni(111), the edge configuration is 
now equivalent to the initial one. Further addi- 
tion of a C top atom (3) requires again overcoming 
the 2.46-eV barrier, completing the cycle that can 
be repeated in a continuous loop, but always 
including this high energy barrier. 

In the alternative path, a Ni adatom attaches 
without an energy barrier to the kink decorated 
by the top C atom (1), yielding an energy gain of 
1.36 eV (1'). The most favorable configuration 
for attachment of a second C atom is again a 
top site, with the Ni adatom diffusing laterally, 
bonding to the two top C atoms (2’). In the last 
step of the growth cycle, another C atom attaches 
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in the fec-hollow position, extending the graphene 
edge and pushing aside the Ni adatom (3'). The 
energy barriers involved in the two steps of this 
cycle are 1.61 and 1.19 eV, respectively. Ni ad- 
atoms, spontaneously binding to the kink site, 
thus reduce the rate-limiting energy barrier of 
the cyclic process by about 35% (from 2.46 to 
1.61 eV), acting as single-atom catalysts for the 
graphene growth process. 

Because the Ni attachment is barrierless, the 
detachment barriers from the two stable states 
at the kink are 1.36 and 1.35 eV, obtained as the 
DFT energy differences of the corresponding equi- 
librium states, i.e., configurations (1') and (2’) of 
Fig. 3, and configurations (1) in Fig. 3 and (e) in 
fig. S2, respectively. These values yield Ni resi- 
dence time values on the millisecond scale, com- 
patible with the values extracted from the STM 
data (fig. S6). 
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ENZYMOLOGY 


Organometallic and radical 
intermediates reveal mechanism 
of diphthamide biosynthesis 


Min Dong,’ Venkatesan Kathiresan,” Michael K. Fenwick,’ Andrew T. Torelli,’ 
Yang Zhang,’ Jonathan D. Caranto,’ Boris Dzikovski,’ Ajay Sharma,” 
Kyle M. Lancaster,’ Jack H. Freed,’ Steven E. Ealick,’* 


Brian M. Hoffman,”* Hening Lin”?* 


Diphthamide biosynthesis involves a carbon-carbon bond-forming reaction catalyzed 
by a radical S-adenosylmethionine (SAM) enzyme that cleaves a carbon-sulfur (C—S) 
bond in SAM to generate a 3-amino-3-carboxypropyl (ACP) radical. Using rapid freezing, 
we have captured an organometallic intermediate with an iron-carbon (Fe—-C) bond 
between ACP and the enzyme’s [4Fe-4S] cluster. In the presence of the substrate 
protein, elongation factor 2, this intermediate converts to an organic radical, formed 
by addition of the ACP radical to a histidine side chain. Crystal structures of archaeal 
diphthamide biosynthetic radical SAM enzymes reveal that the carbon of the SAM C-S 
bond being cleaved is positioned near the unique cluster Fe, able to react with the 
cluster. Our results explain how selective C-S bond cleavage is achieved in this 


radical SAM enzyme. 


iphthamide is a posttranslationally modi- 
fied histidine residue on archaeal and eu- 
karyotic translation elongation factor 2 
(EF2), a protein essential for ribosomal 
protein synthesis (7-3). Diphthamide bio- 
synthesis involves at least four steps and seven 
proteins (4). The first step of the biosynthesis is 
the transfer of a 3-amino-3-carboxypropyl (ACP) 
group from S-adenosyl-L-methionine (SAM) to 
a histidine residue of EF2 (Fig. 1). The enzyme 
that performs this reaction is a Dph2 (diphtha- 
mide biosynthesis protein 2) homodimer in ar- 
chaea, such as Pyrococcus horikoshii (PhDph2) 
(5) or a Dphi-Dph2 heterodimer in eukaryotes 
(6). Dph2 binds an essential [4Fe-4S] cluster and 
is thought to use the reduced (1+) state of the 
cluster to cleave the C,ei-S bond of SAM to 
generate a 3-amino-3-carboxypropyl (ACP) radical. 
The formation of the ACP radical is supported by 
the PhDph2-catalyzed generation of 2-aminobutyric 
acid and homocysteine sulfinic acid in the absence 
of the substrate protein, PhREF2 (5). Additional 
support for a radical mechanism was provided 
by the reaction with a carboxyallyl SAM analog, 
SAM ca (7). However, no radical intermediate has 
been directly observed in the reaction with SAM 
itself, and the detailed reaction mechanism re- 
mains unknown. 
Despite using the same components, a [4Fe-4S] 
cluster and SAM, PhDph2 is structurally unrelated 
to the much larger family of 5'-deoxyadenosine 
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radical (5’-dA+) forming radical SAM (RS) en- 
zymes. In those enzymes, the C5: ,ae-S bond of 
SAM is cleaved, generating a 5’-dAe that then 
initiates downstream reactions (Fig. 1) (8). The 
question of how the PhDph2 homodimer or eu- 
karyotic Dphi-Dph2 bind SAM and achieve a dif- 
ferent SAM cleavage pattern remains unanswered. 
Here, we report biochemical and spectroscopic 
studies of two kinetically competent intermedi- 
ates that, together with x-ray 
crystal structures of archaeal 
Dph2 homodimers in complex 
with SAM, allow us to propose 
areaction mechanism for this 
class of RS enzymes. 

We used rapid freeze-quench 
(RFQ) to arrest reaction mix- 
tures, which were then ana- 


lyzed by electron paramagnetic & 


resonance (EPR) and electron 
nuclear double-resonance 
(ENDOR) spectroscopies for ie} 


the optimum temperature of 
PhDph2 is above 60°C, we 

opted to use the yeast Dph1- 

Dph2 system, which is active 

at room temperature and thus N 
more convenient for RFQ ex- La 
periments. A solution con- 

taining Dph1-Dph2 and SAM N 
was mixed with dithionite I 
and freeze-quenched at 500 ms 8) 
to 4 min. Each sample dis- 

played a new EPR signal, with oO 
g-parallel value (g})) = 2.036, 
g-perpendicular value (g,)= 


—, an 
radical intermediates. Because a ° : N a 


a maximum at 2 s quench time and decreased at 
longer delays (Fig. 2A). This species is not an 
ACP radical, which would not show such a high 
g shift and would exhibit resolvable 'H hyper- 
fine splittings from the methylene radical protons; 
nor is it an S-based radical, which could exhibit 
such g values but again would give resolved hyper- 
fine 'H splittings. Based on these observations and 
experience from previous work using SAMca (7), 
we interpreted this new species as having an 
iron-sulfur cluster-based organometallic structure. 
Importantly, the EPR spectrum and g values of 
this species resemble those of the organometallic 
intermediate that was detected in the RS enzyme, 
pyruvate formate lyase-activating enzyme (PFL-AE) 
and assigned to an Fe-5'-C bond between the de- 
oxyadenosyl group and the unique cluster iron 
(9). As confirmation, when this intermediate was 
generated with *’Fe-enriched enzyme, its EPR spec- 
trum exhibited °’Fe-hyperfine line broadening (fig. 
Sl). Thus, the reaction catalyzed by Dph1-Dph2 
produces an organometallic intermediate. Its g 
values follow the pattern of a [4Fe-4S]** cluster, 
&\| > 8: > 2 CO), where g;, is sensitive to coordination 
at the unique Fe, suggesting a possible formal 
description of the intermediate as containing a 
carbanion bound to an oxidized cluster. 

To establish the structure of this interme- 
diate, we collected RFQ ENDOR samples prepared 
with (methionine-“C,)-SAM, in which the methi- 
onine carbons of SAM were uniformly labeled 
with “C. Field-modulated continuous wave (CW) 
ENDOR spectra obtained at 2 K of this sample 
exhibits a °C doublet, with a °C hyperfine cou- 
pling constant of A ~ 7.8 MHz (Fig. 3A). This 
coupling constant is comparable to Aj, = 9.4. MHz 
observed for the Fe-[5’-"°C]-deoxyadenosyl bond 


Methionine N 


OH OH 


substrate (X-H) 


OH OH 


2.005 at 12 K (fig. S1). The in- 
tensity of this signal reached 
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Fig. 1. PhDph2 and 5'-dA*—forming radical SAM enzyme 
catalyzed reactions. The red asterisk labels Cy met of SAM. 
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Fig. 2. X-band CW EPR spectra of 

RFQ samples showing the formation 
of the organometallic intermediate | 
and organic radical intermediate II. B/G 
(magnetic field/Gauss). (A) Reaction of 
reduced Dphl1-Dph2 with SAM quenched 
at various time points (T = 35 K). 

(B) Reaction of reduced Dph1-Dph2 
with SAM and EF2 quenched at various 
time points (T = 35 K). (C) Reaction 

of reduced Dphl-Dph2 with SAM and 
wild-type EF2 quenched at 2 s and 

2 min (T = 70 K). (D) Reaction of 
reduced Dphl-Dph2 with SAM and 

EF2 H699A mutant quenched at 2 s 
and 2 min (T = 70 kK). In early-time 
spectra, a variable contribution at 

g = 2 from an unidentified organic radical 
enhances the intensity of the g, feature 
of intermediate | relative to that at gj) 

and distorts its shape; fig. S1 presents 
spectra with little of the radical and 

thus a more faithful lineshape of 
intermediate |. 


in the PFL-AE intermediate and together with 
the °’Fe hyperfine broadening provides strong 
support that this intermediate contains an Fe-C 
bond. However, in this case, the Fe-bonded car- 
bon must originate from the “C-labeled methi- 
onine, presumably the C, met. 

In testing whether this organometallic spe- 
cies (denoted intermediate I) was an active reac- 
tion intermediate, we collected a RFQ EPR time 
course in the presence of the substrate protein, 
EF2. Intermediate I accumulated in the first 10 s 
(Fig. 2B). At later time points, the intermediate I 
signal diminished concomitantly with the appear- 
ance of a doublet radical signal centered at g = 2 
(Fig. 2B). This new signal, denoted intermediate 
II, reached a maximum at 2 min quench time and 
decreased at 4 min (Fig. 2B) in a process that 
ultimately leads to product formation. 

The intermediate II EPR signal was distorted 
by saturation effect at 35 K (Fig. 2B) but was 
readily observed at 70 K (Fig. 2C), where spin 
relaxation is faster. Tentative assignment of the 
doublet splitting to coupling to a single proton 
with a large hyperfine splitting (4 ~ 120 MHz) 
led us to hypothesize that this intermediate is 
an organic radical generated when the organo- 
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A g-value 
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metallic intermediate I reacted with the enzy- 
matic target, histidine 699 (H699) of yeast EF2. 
To test this, we repeated the RFQ experiment 
with the EF2 His®° to alanine mutant (H699A). 
With this EF2 mutant, only the organometallic 
intermediate I was detected at both 2 s and 2 min 
(Fig. 2D), and no intermediate II signal was 
detected. This experiment provides strong sup- 
port that intermediate I is chemically competent. 

To test the proposed structure, intermediate II 
was RFQ-trapped with (methionine-"°C,;)-SAM. 
The resulting X-band EPR spectrum showed 
additional splitting of Ajso(1°C) = 59 MHz and 
broadening compared with that of the natural- 
abundance SAM (Fig. 3B), demonstrating that 
the radical incorporates the ACP group of SAM. 
The EPR-resolved doublet, A('H) ~ 120 MHz, 
observed for intermediate II collapsed to a sin- 
glet when intermediate II was prepared with 
°H,-His-labeled EF2 protein (Fig. 3B), confirm- 
ing that the splitting observed with the natural- 
abundance EF2 was associated with 'H of the 
histidine residue on EF2. This was further con- 
firmed by ‘H ENDOR of intermediate II with EF2, 
which exhibited a nonexchangeable 'H signal 
with the same coupling (fig. $2); an exchange- 
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1 1 1 1 i 1 
3100 3200 3300 3400 3500 3600 
B/G 


able coupling, ACH) ~ 15 MHz not resolved in the 
EPR spectrum was also seen. These results con- 
firm that intermediate II is a radical produced 
by the reaction of an ACP radical from inter- 
mediate I with the imidazole ring of His®®® of 
the EF2 substrate. 

Given the structure of the final diphthamide 
product, a candidate for the structure of inter- 
mediate II is an ACP-modified histidine radical, 
as shown in Fig. 4. The shape of the 'H doublet 
in the EPR spectrum (Fig. 3B) and the strongly 
coupled ‘H ENDOR signal (fig. S2) suggest that 
the proton coupling is essentially isotropic, as 
expected for a proton f to the spin in the N-2p-n 
orbital (Fig. 4 and fig. $3) (11); the resolved °C 
splitting in the EPR spectrum of intermediate II 
(Fig. 3B) prepared with (methionine-C;)-SAM 
must come from the carbon of the ACP fragment 
bound to the His, again B to the spin site. The in- 
dividual peaks of the 'H doublet are broad enough 
to contain unresolved “N splitting introduced 
by the spin density on the “N6é of His (Fig. 4). As 
shown in fig. S4, the EPR spectrum of the natural- 
abundance radical is well simulated with a strong, 
nearly isotropic 'H coupling (A, = Ay = 122 MHz; 
Az = 103 MHz) and a highly anisotropic ““N 


2 of 4 


810Z ‘G} Yyoueyy UO /G10' Bewisoual0se0ue!0S//:d}]y Wod| pepeojuMOGg 


RESEARCH | REPORT 


Fig. 3. EPR and ENDOR spectra of 
isotope-labeled intermediates 

and crystal structures of 
SAM-bound CmnDphz2. 

(A) 35 GHz CW 8C ENDOR 

spectra for intermediate | with 
(methionine-8Cs)-SAM. (B) Isotope 
EPR study of the organic radical 
intermediate Il on EF2 quenched at 

2 min (T = 70 kK). Intensity adjusted 
for comparison. (©) Cluster and bound 
SAM in the structure of CmnDph2 
(left), in comparison with cluster and 
bound SAM in the structure of PFL-AE 
(right, PDB 3CB8) (13). Distances are 
given in angstroms. The black and 
brown asterisks label Cy met of 

SAM and the differentiated iron 

of the [4Fe-4S] cluster, respectively. 
(D) Overlay of SAM and cluster 

in CmnDph2 and PFL-AE structures. 
(E) CmnDph2 active site showing 

the binding of SAM. Hydrophobic 
interactions with purine ring of 

SAM are not shown for clarity. 


hyperfine coupling (4; = A, = 4 MHz, Az = 54 MHz). 
5N features seen in the ENDOR of the “N-His 
EF2 sample (fig. S5) likely are a superposition of 
signals associated with the A,/A, components of 
this histidine nitrogen (assigned as N6 in Fig. 4), 
and those from the less strongly coupled His Ne. 
Finally, weakly coupled *C ENDOR signals seen 
when intermediate II was prepared with “C-His 
EF2 (figs. S5 and S6) are assigned to His carbons 
bonded to the spin-bearing N6 (Fig. 4) and ac- 
count for the slight broadening in the compo- 
nents of the 'H doublet in this isotopologue; an 
exchangeable 'H seen in ENDOR (fig. S2) is 
thought to reside on Ne of His. 

The time scale for intermediate II formation 
measured by EPR tracked that of intermediate I 
decay, whereas the reaction product, ACP-modified 
EF2 as quantified by mass spectrometry, accumu- 
lates on the same time scale as the intermediate II 
decay (fig. S7). These results indicate that both 
intermediate I and intermediate II are formed in 
a kinetically competent fashion. Thus, we have 
trapped two sequential intermediates, an organo- 
metallic intermediate I, which gives rise to an 
organic radical intermediate II, which goes on to 
complete the first step of diphthamide biosynthesis. 
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A (Methionine-“C,)-SAM 
7. /—Femez MHz 
* a 0) ie 
Cc 


SAM/Dph2 


SAM/PFL-AE 


To gain further understanding into the reac- 
tion mechanism, we obtained crystal structures 
of the RS enzymes involved in diphthamide 
biosynthesis in complex with SAM. A 2.3-A res- 
olution structure of [4Fe-4S]?*-loaded PhDph2 
in complex with SAM showed some evidence of 
SAM cleavage. A 2.1-A resolution structure of 
Candidatus Methanoperedens nitroreducens Dph2 
(CmnDph2) showed mostly uncleaved SAM. Both 
structures showed average N-Fe and O-Fe dis- 
tances of 2.3 and 3.1A, respectively (Fig. 3C and 
fig. S8), suggesting that, similar to 5’-dAe-forming 
RS enzymes, the amino and carboxylate groups of 
SAM coordinate the unique Fe of the [4Fe-4S] 
cluster in Dph2. This is consistent with our recent 
study showing that both the amino and carbox- 
ylate of SAM are important for the PhDph2 and 
Dphi-Dph2-catalyzed reactions (12). 

However, SAM in both PhDph2 and CmnDph2 
binds with a distinct geometry compared with 
that in 5'-dAe-forming RS enzymes such as PFL- 
AE (13) (Fig. 3D). We further obtained addition- 
al crystal structures using both PhDph2 and 
CmnDph2 that support the geometry shown in 
Fig. 3, C and D (see figs. S9 to S11). The SAM 
binding site of CnnDph2 is shown in Fig. 3E. 
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Gln?’ forms hydrogen bonds with N6 and N7 
of the adenine ring and the amide nitrogen of 
Val?® hydrogen bonds to NI. The adenine ring is 
sandwiched between Phe” and Ile”*°. Asp”*° and 
Asp”°° form hydrogen bonds with the hydroxyl 
groups of the ribosyl moiety, whereas Arg”®?, 
His'®°, and Gly”® form hydrogen bonds with 
the carboxylate of SAM. 

Most strikingly, the C, ret distance to the unique 
Fe, as averaged over the PhDph2 and CmnDph2 
structures, is 3.7 A, which is closer than Syye to the 
Fe (the S,y.¢-Fe distance is 4.6 A). In contrast, in 
the PFL-AE structure, the Syrer is closer to the 
unique Fe than Cs. age, with an Sy\et-Fe distance 
of 3.2 A (Fig. 3C). The observation that in PhDph2 
and CmnDph2, Cy yer was closer than Syye to the 
unique Fe suggests that during the Cy met-Smet 
bond cleavage, the electron is transferred from 
the Fe-S cluster via Cy met, whereas for PFL-AE 
(and generally believed for all 5’-dAe-forming 
RS enzymes), the electron is transferred from 
the Fe-S cluster via Syyet (8). 

Based on the two intermediates that we have 
detected and the structures of PhDph2 and 
CmnDph2 in complex with SAM, we propose 
a reaction mechanism for the first step of 
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Fig. 4. The proposed reaction mechanism of diphthamide biosynthetic RS enzymes. 


diphthamide biosynthesis (Fig. 4). The unique 
iron of a [4Fe-4S]* cluster in an archaeal Dph2 ho- 
modimer or eukaryotic Dph1-Dph2 heterodimer 
attacks the y carbon of the methionine in SAM, 
generating a 3-amino-3-carboxypropyl-[4Fe-4S]}** 
organometallic intermediate I and 5’-methyl- 
thioadenosine (MTA). The organometallic inter- 
mediate I is essentially a stabilized ACP radical. 
In the presence of the substrate EF2, the Fe-C 
bond can break homolytically, allowing the ACP 
radical to react with the imidazole ring of the 
histidine residue of EF2 and generate the or- 
ganic radical, intermediate II. Intermediate II 
then loses a proton and an electron to form an 
ACP-modified histidine. 

In the study of PFL-AE, it was noted that the 
Fe-[5’-C]-deoxyadenosyl organometallic interme- 
diate could be generated through a radical reac- 
tion or nucleophilic reaction (9). In our case, the 
formation of intermediate I could occur via homo- 
lytic reductive cleavage of SAM to form an ACP 
radical, which then reacts with the unique Fe 
(stepwise one-electron transfer) to form the or- 
ganometallic intermediate I. Alternatively, a con- 
certed one-step, two-electron transfer (nucleophilic 
attack on C, yet) can form intermediate I directly. 
The fact that Cy met was only 3.7 A away from the 
unique Fe, whereas the sulfonium moiety was 
4.6 A away (Fig. 3C), favors the latter mechanism, 
although we cannot eliminate the homolytic 
cleavage mechanism. 

The structure also provides a simple solution to 
the question, how does the archaeal Dph2 homo- 
dimer or eukaryotic Dphl-Dph2 heterodimer 
cleave a different C-S bond in SAM compared 
with all 5'-dA*-forming RS enzymes? If the cluster 
interacts with and transfers electrons to Cy wet 
instead of Syet, the only C-S bond that can be 
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broken in SAM is the C, yyer-S bond. Previously, 
Kampmeier proposed that a stereo-electronic con- 
trol mechanism could explain the different cleav- 
age patterns (/4). This model rationalizes the C-S 
bond cleavage in RS enzymes using a radical dis- 
placement reaction, with the formation of an Fe-S 
bond accompanying the cleavage of the C-S bond 
(fig. S12). This radical displacement reaction also 
requires that the C-S-Fe atoms involved in the 
release of a radical on carbon are arranged roughly 
colinearly. Thus, if SAM in archaeal Dph2 homo- 
dimer or eukaryotic Dph1-Dph2 is bound in a con- 
formation with Syer Close to the Fe-S cluster and 
colinear C, yye-S-Fe arrangement, it would lead to 
the cleavage of C, me:-S. However, our structure 
indicates that in archaeal Dph2 homodimer, the 
C, met is close to the unique Fe. The Cy ye; me- 
diates electron transfer from the Fe-S cluster, 
dictating which bond is cleaved. This is an im- 
portant difference between diphthamide bio- 
synthetic RS enzymes and 5’-dAe-forming RS 
enzymes. Why does nature choose this way to 
control the bond cleavage in diphthamide bio- 
synthetic RS enzymes? Perhaps conformational 
constraints on SAM prevent the C, me:-S-Fe co- 
linear arrangement proposed by Kampmeier, 
whereas the roughly colinear S-C, wei-Fe ar- 
rangement as revealed by the structures is feasible 
(fig. S12). 

In summary, this study has explained how 
diphthamide biosynthetic RS enzymes create the 
EF2 product through cleavage of the C,er-S 
bond of SAM in a process that involves first an 
organometallic intermediate, then a radical inter- 
mediate that collapses to product. Together with 
the recent study showing an organometallic inter- 
mediate in a 5’-dAe-forming RS enzyme PFL-AE, 
these studies further suggest that an organo- 
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metallic intermediate may serve as a stabilized 
form of the highly reactive primary organic radical 
and provide a strong parallel between RS enzymes 
and adenosylcobalamin-dependent enzymes. 
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INDUCED SEISMICITY 


Oklahoma’s induced seismicity 
strongly linked to wastewater 


injection depth 


Thea Hincks,”? Willy Aspinall,’?*? Roger Cooke,*> Thomas Gernon®* 


The sharp rise in Oklahoma seismicity since 2009 is due to wastewater injection. The role of 
injection depth is an open, complex issue, yet critical for hazard assessment and regulation. We 
developed an advanced Bayesian network to model joint conditional dependencies between 
spatial, operational, and seismicity parameters. We found that injection depth relative to 
crystalline basement most strongly correlates with seismic moment release. The joint effects 
of depth and volume are critical, as injection rate becomes more influential near the basement 
interface. Restricting injection depths to 200 to 500 meters above basement could reduce 
annual seismic moment release by a factor of 1.4 to 2.8. Our approach enables identification of 
subregions where targeted regulation may mitigate effects of induced earthquakes, aiding 
operators and regulators in wastewater disposal regions. 


klahoma’s ~900-fold increase in the an- 

nual rate of seismicity since 2009 (1) (fig. S1) 

makes it the most seismically active region 

in the contiguous United States. Wastewater 

disposal via deep injection wells explains 
the surge in earthquake activity (/-7). In many 
cases, pressurized fluids migrate from target 
formations downward into the crystalline base- 
ment (J, 7-10), where most earthquakes occur 
(11-18). The disposed fluids may locally increase 
pore fluid pressure and reduce the effective 
stress along faults (2, 9, 17), resulting in induced 
seismicity (2, 79). In September 2016, a magni- 
tude (M,,) 5.8 earthquake centered near Pawnee 
caused injury and damage to buildings, prompt- 
ing the immediate suspension of 37 disposal 
wells (20). The number of damaging earthquakes 
(5, 6, 21) resulted in litigation against well oper- 
ators (22), increased risk to critical infrastructure 
(23), and considerable financial loss in the re- 
insurance industry (24). 

The escalating seismicity compelled Oklahoma 
regulators to introduce emergency directives 
aimed at managing injection (25), informed by 
an empirical understanding of how seismicity 
responds to changes in injection rates or vol- 
umes (2, 4, 6). These measures appear to have 
decreased the earthquake count (,, = 3) in the 
period 2015 (27) to the present (fig. S1). Con- 
sidering the purported strong association be- 
tween seismicity and oil production in Oklahoma 
(26), a major driver of this trend could have been 
abated injection due to relatively low oil prices. 
However, a case has been made (27) that the 
probabilities of occurrence of larger-magnitude 
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earthquakes are not decreasing as rapidly as other 
models suggest (1). The total seismic moment 
release has declined only modestly despite de- 
creased injection volumes (fig. $1). The ultimate 
impact of reduced injection may take years to ma- 
terialize given observed lagging in the response 
of seismicity levels to injection (JJ, 27, 28). 

The Oklahoma Corporation Commission (OCC) 
requires operators to prove that their injection 
does not encounter basement lithologies (29), 
but debate remains on the precise influence of 
injection depth. Although a study encompass- 
ing the wider Central and Eastern United States 
(CEUS) found no significant correlation between 
proximity to basement and earthquake occur- 
rence (4), other studies suggest that injections 
near, or within, crystalline basement might in- 
crease the likelihood of seismicity locally (12, 30, 37). 
Therefore, we need to analyze the joint effects 
of spatial, operational, geologic, and seismicity 
parameters (4, 19, 21). 

We developed a Bayesian network (BN) to 
quantitatively evaluate correlations between 
well operational parameters, geologic setting, 
and seismicity, focusing on the “Oklahoma in- 
duced seismicity zone” (5) (Fig. 1). Our BN frame- 
work capitalizes on mathematical advances (32) 
and innovative software, UNINET (33), imple- 
mented in R (34). UNINET models joint de- 
pendency using continuous data distributions, 
bypassing the limiting assumptions of discrete 
approximations (32). Monthly underground in- 
jection control (UIC) well data for January 2011 
to September 2016 were obtained from OCC (35), 
including injected volume, pressure, duration of 
well activity, and total well depth. Earthquake 
location and magnitude data, from the Advanced 
National Seismic Systems’ (ANSS) earthquake 
catalog, ComCat (36), were used to calculate total 
seismic moment release. We considered earth- 
quakes of M,, = 2, with a hypocentral depth 
<10 km (fig. $3), and a hypocenter falling within 
20 km of an injection well (32). We chose this 
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distance on the basis of the documented occur- 
rence of associated earthquake swarms within 
10 km (37) to 20 km (3) of high-rate disposal 
wells, supported by additional calculations that 
indicate strong correlations between injection 
and moment up to ~20 km (32). Basement depths 
at well sites were estimated by spline interpo- 
lation (32, 38) using 1232 depth records from 
Oklahoma Geological Survey (39) (fig. S5). 

Over the study period, >10,000 active class IT 
UIC wells in Oklahoma (Fig. 1, A and B) col- 
lectively injected on average ~2.3 billion barrels 
(bbl) of fluid per year via saltwater disposal (SWD) 
and enhanced oil recovery (EOR) (35) (figs. Sl and 
$2 and table S1). We considered class II injections 
into the full range of target geologic formations, 
to capture all potential contributions to induced 
seismicity (32). Wastewater disposal is particu- 
larly intensive within the Cambrian-Ordovician 
Arbuckle Group, a sedimentologically and struc- 
turally heterogeneous sequence of dolomitized 
carbonates and breccias (40) ~0.15 to 2 km thick 
throughout most of the study area. Sedimentary 
cover is hydraulically connected to Precambrian 
crystalline basement (J, 7-9) (fig. S6) via an ex- 
tensive fault network (14, 17, 41, 42) (Fig. 1C). 

High-rate injection has been linked to in- 
creased seismicity (4). However, analysis using 
BN and regression models indicate that other 
parameters have an important, but largely un- 
quantified, impact on seismic hazard. We ex- 
plored correlations between key operational and 
spatial characteristics and seismicity, comparing 
a number of network configurations with a sim- 
ple linear regression (Fig. 2) to determine the 
most appropriate model (32). We defined nodes 
for key observables, including well operating 
parameters (depth, monthly, annual and cumu- 
lative injected volume), geologic parameters (dis- 
tance to basement), and total seismic moment 
release within 20 km of the well in the year 
ahead (Fig. 2, A and B) (32)—a standard time 
scale for operational and seismic hazard fore- 
casts (5, 6, 21). To improve forecasting for deeper 
and higher-volume wells associated with higher 
seismicity (Fig. 3 and fig. S7) (32), our models 
focused on wells =1 km deep and cases of month- 
ly injection >10,000 bbl, i.e., those of greater 
concern to regulators (20). We used UNINET in 
“data mining” mode to estimate the set of joint 
normal copulae (multivariate probability dis- 
tributions) that most closely represented the 
input data (32). We used the resulting models to 
estimate moment release for any given set of 
operating conditions. 

The saturated BN (Fig. 2A) (32) gave the most 
complete representation of the observed data 
under stationary conditions, outperforming the 
regression model (in-sample testing, table S3 and 
figs. S9 and S10). However, the unsaturated BN 
(Fig. 2B) provided better forecasting under the 
changing injection regime of 2015 to 2016 (fig. S1B). 
This is because rank correlations between volume 
variables in the saturated BN preserved relation- 
ships (associated with increasing injection) that 
were no longer valid under post-2015 operating 
conditions. 
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We used two test data sets for model verifica- 
tion (fig. S11 and table S3). The first demonstrated 
operational use of the BN to forecast future mo- 
ment release, learning with observations from 
January 2011 to June 2015, and forecasting from 
July to December 2015. The second test used 90% 
of the full data set for learning (January 2011 
to December 2015, randomly sampled) and the 
remaining 10% for verification (fig. S11) (32). We 
compared forecast results using the BN and re- 
gression models (Fig. 2, figs. S12 and S13, and 
table S6) and found that in both cases, the BN 
gave the best performance. 

The spatial distribution of earthquakes (Fig. 1, 
B and C, and fig. S2) implies important under- 
lying geologic characteristics influencing seis- 
micity. The BN cannot explicitly represent these 
characteristics because of the variability and pau- 
city of geologic data at relevant depths and re- 
solution across the region. To address this issue, 
we applied the models iteratively to identify where 
(spatially) they systematically under or overfore- 
cast, because of latent features outside the scope 
of the model. We used both BN and regression 
models to estimate seismic moment for a subset 
of the training data and used the forecast error to 
compute a “geospatial correction” map—a proxy 
for unobserved geologic data (32) (fig. S8). We 
repeated the learning step with the training data 
using the geospatial correction as an additional 
input node. We used this revised network (Fig. 2) 
for final forecast verification (figs. S10, S12, and 
$13) (32). 

The geospatial correction shows the strongest 
(negative) correlation (-0.55) with annual mo- 
ment release (Fig. 2C), supporting the obser- 
vation that lithologic (9) and fault network 
characteristics (14, 16-18) collectively play a 
critical role in determining susceptibility to in- 
duced seismicity. Root mean squared errors, 
mean absolute errors, and median average de- 
viation for the BN and regression models, both 
with and without the geospatial correction node, 
show the improvement in forecast skill resulting 
from this additional step (table S6, Test 2). This 
degree of improvement suggests that the geo- 
spatial correction effectively characterizes the 
impact of important, but unobservable (latent), 
factors. It could additionally prove valuable in 
mapping spatial variation in induced earthquake 
hazard (27). 

Our analysis with UNINET shows that dis- 
tance to the basement interface is more strongly 
correlated (-0.34) with annual seismic moment 
release than injected volume; and cumulative 
injected volume has a stronger correlation (+0.26) 
with annual seismic moment release than annual 
(+0.20) or monthly (+0.18) volume (Fig. 2C, em- 
pirical rank correlations). However, we expect 
these volume correlations to change with time 
owing to the effects of regulation, market-driven 
changes in oil production and associated dis- 
posal, and time lags between injection and in- 
duced seismicity. For example, cumulative volume 
will become less well correlated with annual 
moment if the current trend of reduced seismicity 
continues (fig. S1A). 
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UNINET uses joint normal copulae (32,33). 
As several input distributions are highly skewed 
and seismic moment is censored (because of cat- 
alog incompleteness for M,, < 2.5; fig. S4), the 
normal copula cannot perfectly represent the data. 
We see this in the difference between the empirical 
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and BN unconditional rank correlations (table S5) 
and cumulative distribution functions (CDFs; 
Fig. 2 and figs. S12 and S13). However, calcula- 
tions suggest that using optimal copulae would 
yield only a small improvement, negated by 
greatly increased computational complexity (32). 


Seismic moment (Nm) 
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Fig. 1. Wastewater injection and seismicity in Oklahoma. (A). Map showing the surface 
expression of key geologic provinces of Oklahoma (41) and class II UIC wells (35) (related to oil 
and gas production). Black polygon defines the “Oklahoma induced seismicity zone” delineated by 
the USGS (5). (B) Map showing total injected volume (bbl) for 2011 to 2015 (30-km Gaussian 
kernel density). Symbols denote locations of earthquakes from 2011 to 2016, including the 5 
November 2011 Prague (7, 10, 11) and 3 September 2016 Pawnee (18, 21, 28, 44) events. (C) Map 
showing total seismic moment release from 2011 to 2016 (20-km Gaussian kernel density), 

with mapped faults in the sedimentary cover including the Nemaha uplift (arrow) also shown 
(42) [this does not include all large-scale basement structures (17)]. Yearly injection volumes and 


seismic moment release are shown in fig. S2. 
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Fig. 2. BN structure, unconditional rank correlation coefficients, and cumulative distribution 
functions (CDFs) for Test 2. (A) The saturated network and (B) the unsaturated network. Arcs denote 
influence between nodes. (C) Unconditional rank correlation coefficients for all nodes with logi9 annual 
moment release (empirical data and BNs). This correlation is a measure of the strength of influence of 


each observable on the node of interest (moment rel 


ease). The BN uses joint normal copulae to represent 


the empirical data—the closer the values in the BN rank correlation matrix are to the empirical rank 
correlation matrix, the better the model approximates the data (matrix determinants can also be 


compared). Full correlation matrices are provided in 


table S5. (D and E) CDFs of logig annual moment 


release for the BN forecast (mean and median estimate), linear regression model, and empirical data 
for (D) saturated BN and (E) unsaturated BN. This function shows how closely the model output 


distribution matches the observations. Results are s 


hown for Test 2-learning using 90% of the 


observations, testing with the remaining 10% (see fig. S11). 


The asymmetry of the data also means that a 
linear model will not perform as well as the BN 
(e.g., Fig. 2, D and E). 

We demonstrate how the BN could be used to 
quantify the expected impact of UIC regulations 
with three applied examples. We can (i) reduce 
injection depth by a fixed amount [to simulate 
well plug back (29) or optimal depth of a new 
well], (ii) cap injection volume according to im- 
posed regulatory limits (25, 43), and (iii) restrict 
injection depth in the Oklahoma induced seis- 
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micity zone, by enforcing a minimum permissi- 
ble distance above the basement interface. We 
intend these examples as demonstrations of how 
the BN could be used in practice. Before using 
this approach to inform operational or regula- 
tory practices, further detailed simulations would 
be required to account for local-scale suscep- 
tibility to induced seismicity. 

For a subset (10%) of well records (fig. S11), we 
reduced well depth by 1000 m, and the resulting 
“test data” were used to produce a set of revised 
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forecasts for annual moment release. We cal- 
culated the change in mean moment for each in- 
dividual case due to the change in depth (Fig. 4). 
For above-basement wells, raising the well by 
1000 m shifted injection further from the base- 
ment (shallower) and led to a reduction in seismic 
moment release. We found the greatest reduc- 
tion in wells that inject higher volumes (Fig. 4A). 
Conversely, increasing depth to inject closer to 
(or into) the basement led to an increase in 
seismic moment release (fig. S14). 

To model the potential impact of regulating 
volume, we applied a daily limit of 15,000 bbl per 
well, as imposed by Oklahoma regulators (25), 
similar to the Kansas 16,000 bbl/day limit for 
areas of “seismic concern” (43). We applied this 
limit to the full data set and used the revised 
monthly injections to recalculate annual and 
cumulative volumes for each well (32). We then 
used the BN to simulate the revised annual mo- 
ment release for all individual monthly well re- 
cords with reduced injection (Fig. 4B). 

As expected, our simulations showed that wells 
subjected to the largest drop in monthly and 
annual injected volume yielded the greatest re- 
duction in moment release (Fig. 4B). We found 
a small number of exceptions in which large 
changes in mean annual moment were predicted 
for wells with relatively low annual injection 
rates (Fig. 4B). These cases largely corresponded 
to wells injecting close to the basement (fig. S15), 
where more modest changes in injection volume 
can yield proportionally larger decreases in mo- 
ment (Fig. 4). This demonstrates the importance 
of quantifying the joint effect of operational pa- 
rameters (depth, cumulative volume and injec- 
tion rates) on future seismicity levels. 

We performed a final set of simulations to 
quantify the statewide impact of depth restric- 
tion, using depth limits of (i) 200 m and (ii) 500 m 
above basement, wherever this could be practi- 
cally applied. Using well records for December 2015 
to predict annual moment release in the year 
ahead, we found an indicative (mean) reduction 
in total annual moment release for the Oklahoma 
induced seismicity zone of a factor of ~1.4 for the 
200-m limit and a factor of ~2.8 for the 500-m 
limit (32). 

A distinct advantage of the BN is the ability 
to update the model and produce revised fore- 
casts as data become available. For example, 
using newly released (September 2017) OCC 
injection data (35), we tested BN forecasting 
for 2016, first using the model as developed 
above (fig. S16) and second, using an updated 
geospatial correction and moving window for 
learning, which (by partially accounting for 
temporal changes in the system) improves per- 
formance (figs. S17 and S18) (32). The empirical 
and unconditional BN rank correlations (table 
S7) computed with the complete updated data 
set are not considerably different from those for 
2011 to 2015 (table S5), lending credence to the 
basic model. 

Our results indicate that seismic moment re- 
lease in Oklahoma is strongly correlated with in- 
jection proximity to crystalline basement (Figs. 3 
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and 4A and fig. $7), corroborating previous 
hypotheses (30, 37) but contrary to observations 
encompassing the CEUS (4). We attribute this 
disparity to scale dependence (19). In Oklahoma, 
the permeability structure of the Arbuckle Group 
permits downward fluid migration into crystal- 
line basement (J, 9), causing reactivation of op- 


log,, annual 
moment (Nm) 
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Fig. 3. Probability distributions for well 
depth and distance to basement using BN 
inputs for January 2011 to December 2015 
(wells =1 km deep and injection records 
>10,000 bbI/month). Probability density for 
(A) well depth below surface; (B) depth relative 
to basement and (C) distance to basement; iden- 
tifying (i) cases with low or no seismic activity 
(logig moment in 1 year <13; shaded blue) and 

(ii) cases with relatively high seismic activity (logi9 
moment in 1 year =15; shaded pink). The distribu- 
tions in (B) and (C) show that wells drilled closer 
to the basement are associated with higher 
seismic moment release. Distributions for injection 
volume are shown in fig. S7. 
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timally oriented strike-slip faults (14, 16-18).Ona 
continental scale, lithologic and structural heter- 
ogeneity and concomitant stress-field variations 
could obscure any association between depth 
and seismicity, as some lithologies are more sus- 
ceptible to earthquakes (9) (fig. S6); e.g., where 
fracture flow causes localized pore fluid pres- 
sure increases, allowing rupture along critically 
stressed faults (9). Regions of relatively high- 
permeability basement in other parts of the CEUS 
will conceivably permit greater pore fluid diffu- 
sion and pressure release, inhibiting earthquakes 
(9). Induced seismicity is also controlled by local 


hydrogeologic conditions (30) and the location, 
scale, and orientation of critically stressed faults— 
all unique aspects of the geotectonic setting. 
This highlights the importance of local-scale 
assessments. 

Our Oklahoma assessment exploited a 6-year 
record of fluid injection to develop a compre- 
hensive understanding of the controls on in- 
duced seismicity. The two-stage BN quantifies 
the joint effects of operational parameters and 
latent spatial features on seismic moment re- 
lease, facilitates regular model updating, and of- 
fers improved forecast performance compared to 
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Fig. 4. Simulated impact of raising the injection well level or capping monthly injection volume. 
(A) For a subset of records, well depth was reduced by 1 km and used to generate revised estimates 
for annual moment release using the BN (colored points) and linear regression models (black points). 
The plot shows change in annual moment versus original well depth and predicts the greatest 
reduction in moment release for wells initially closest to the basement interface. The joint effect of 
cumulative volume is also seen—wells with higher cumulative injections are expected to experience 
proportionally greater moment reduction. (B) Predicted change in mean annual moment resulting 
from a regulatory limit of 15,000 bbl/day (450,000 bbl/month) versus monthly injection with no cap. 
This plot shows cases of high injection only (where the limit is applied). Points are colored by total 
reduction in annual volume to illustrate the importance of maintaining injection limits over time. Inset: 
CDFs for moment release (BN mean estimates) show the overall effect on all high-volume wells. 
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traditional linear models. This approach has im- 
portant implications for statewide assessments 
and regulatory decision-making under geologic 
uncertainty [note that the Pawnee (16, 18, 44), 
Cushing (44), Prague (11, 14), and Fairview (45) 
earthquakes occurred on previously unrecognized 
faults or splays]. Our framework allows regulatory 
actions to be evaluated on a rational, quantitative 
basis in terms of seismic effects. The model could 
be used to identify evolving vulnerability, comple- 
menting zoned seismic hazard forecasts of the 
U.S. Geological Survey (USGS) (2D, and provide 
a basis for targeted management of induced 
seismicity in Oklahoma and other wastewater 
disposal regions. We welcome the support of 
researchers, operators, regulators, and policy- 
makers in updating and extending our model. 
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FISHERIES 


Protecting marine mammals, 
turtles, and birds by rebuilding 


global fisheries 


Matthew G. Burgess,”?*+ Grant R. McDermott,®"+ Brandon Owashi,»” 
Lindsey E. Peavey Reeves,* Tyler Clavelle,”” Daniel Ovando,” Bryan P. Wallace,”’® 
Rebecca L. Lewison,’ Steven D. Gaines,”” Christopher Costello”” 


Reductions in global fishing pressure are needed to end overfishing of target species 

and maximize the value of fisheries. We ask whether such reductions would also be 
sufficient to protect non-target species threatened as bycatch. We compare changes in 
fishing pressure needed to maximize profits from 4713 target fish stocks—accounting 

for >75% of global catch—to changes in fishing pressure needed to reverse ongoing 
declines of 20 marine mammal, sea turtle, and seabird populations threatened as 
bycatch. We project that maximizing fishery profits would halt or reverse declines of 
approximately half of these threatened populations. Recovering the other populations 
would require substantially greater effort reductions or targeting improvements. Improving 
commercial fishery management could thus yield important collateral benefits for 


threatened bycatch species globally. 


isheries employ 260 million people and 
fish are a primary animal protein source 
for roughly 40% of the world’s population 
(1). Recent studies suggest that more than 
half of the world’s fisheries are overfishing 
(2), and rebuilding these fisheries could increase 
global fishing yields by ~15% and profits by ~80% 
(2, 3). Fisheries also affect many protected, non- 
target species through bycatch (incidental capture), 
including ecologically important and charismatic 
megafauna such as marine mammals, sea turtles, 
seabirds, and sharks (4). Some of these bycatch 
species, such as Mexico’s vaquita porpoise 
(Phocoena sinus) and New Zealand’s Hector’s 
dolphin subspecies (Maui dolphin, Cephalorhynchus 
hectori maui), face imminent extinction (5, 6). For 
these reasons, ending overfishing and protecting 
threatened bycatch species are two of the main 
goals of modern marine conservation efforts. 

At first glance, sustaining high fishery profits 
and yields can seem in conflict with bycatch species 
conservation. Unless targeting can become more 
selective through changing fishing technology or 
practices, reducing bycatch requires reducing tar- 
get stock catch. However, because rebuilding over- 
fished target stocks requires reducing fishing 
effort, bycatch populations should also benefit. 
Indeed, regions with the most severe bycatch— 
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coastal fisheries of the developing world and, 
to a lesser extent, high-seas fisheries (4)—also 
experience some of the most severe overfishing 
(2) (Fig. 1 and fig. $1). 

We quantify the trade-offs globally between 
protecting bycatch species and meeting economic 
fisheries objectives. To do this, we compare esti- 
mates of the changes in fishing pressure needed 
to maximize long-term profits [termed “maxi- 
mum economic yield” (MEY)] for 4713 fish stocks, 
accounting for >75% of global catch (2), to the 
changes in bycatch mortality needed to reverse 
ongoing population declines of 20 populations 
substantially affected by fisheries bycatch, for 
which sufficient published information is avail- 
able to calculate the reductions in mortality 
needed to prevent further declines (materials 
and methods and table S1). 

Our sample includes 9 of 26 marine mammal 
populations, 6 of 8 sea turtle populations or 
species, and 3 of 22 seabird populations that the 
International Union for Conservation of Nature 
(IUCN) identifies as threatened, declining, and 
having bycatch as a primary threat (7). We also 
include the Northwest Atlantic loggerhead turtle 
(Caretta caretta) population, but it is not listed as 
threatened by the IUCN owing to uncertainty as 
to whether it remains in decline (7) (materials 
and methods). The IUCN last assessed olive rid- 
ley turtle (Lepidochelys olivacea) populations 
jointly (7), and we include two of these in our 
analysis (materials and methods). We restrict 
our analysis to marine mammals, sea turtles, 
and seabirds, because they are rarely retained or 
commercially valuable (4). However, future work 
could use similar methods to consider sharks, 
rays, and other taxa retained as both target and 
non-target catch (8). 

Accounting for multiple uncertainties, we ask 
how likely it is that solely managing all target 
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fisheries to MEY would reduce bycatch mortality 
sufficiently to halt each bycatch population’s de- 
cline. We further ask how much long-term profit 
would need to be foregone, or how much more 
selective targeting would need to become, to en- 
sure that each bycatch population’s decline was 
halted. In other words, we assess whether there 
is currently a trade-off between maximizing long- 
term profit and halting each bycatch population’s 
decline, and how severe the trade-off is, if one 
exists. In the supplementary materials, we ex- 
plore trade-offs relative to maximum long-term 
catch [termed “maximum sustainable yield” (MSY)] 
and obtain results similar to those for MEY 
(figs. S2 to S4). 

We assume that each population’s annual rate 
of change (denoted A, e.g., A = -0.05 year? im- 
plies a 5% annual decline in abundance) can 
be approximately expressed as (materials and 
methods) 


A=A, — Fe (1) 


Here, A,, denotes the annual rate of change 
in abundance that would occur if there were no 
bycatch, and F denotes the “effective” annual 
bycatch mortality rate—the fraction of the pop- 
ulation’s total reproductive value removed by 
bycatch annually. Derived from age-structured 


A 
we 
i 0.4 % T 
a 0% 
© ios. 25% 
2a” 50% 
oO 75% 
5 0.24 100% 
£ © Bird 
£ O Cetacean 
£014 © Pinniped 
o A Turtle 
a 

0.05 


04 03 02 01 00 
Rate of population change (A) 


Reduction in fishing pressure (MEY vs. 2010-2012) 
0% 25% 50% 75% 100% 


Fig. 1. Reductions in fishing pressure 

needed to meet profit and bycatch 
objectives. (A) Population decline (A) and 
bycatch mortality (F.) rates, and reductions 

in bycatch mortality needed to halt population 
declines (%T), for 20 bycatch populations. 

(B) Projections (2) of average reductions in 
target stock fishing mortality (weighted by 2010 
to 2012 expenditure), by FAO Major Fishing Area 
(except Arctic Sea, gray), under the MEY 
scenario [same color bar as in (A)]. 
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population models, reproductive value measures 
the relative contributions of individuals in each 
age group to overall population growth [e.g., see 
(9, 10)]. We use this measure to standardize 
bycatch of different ages across fisheries, since 
fisheries primarily causing bycatch of breeding 
adults tend to have much larger population im- 
pacts than fisheries causing bycatch of small juve- 
niles [e.g., (J0)]. To keep the units of Eq. 1 consistent, 
we also measure A and A, in reproductive-value 
units where possible, i.e., where a published age- 
structured assessment is available [e.g., (10)]. 
Otherwise, we assume that abundance and mor- 
tality trends measured in individual units reflect 
trends in reproductive value. 

From Eq. 1, we calculate the percentage (de- 
noted %7') by which each bycatch population’s 
mortality rate, F., would have to decrease to 
halt its population decline (i.e., A = 0), if all other 
mortality sources remained constant: 


Loy, 
0O=A, — (: = aa) (2A) 
%T = 100 ( =) 100( 2) 
A 
= 100 (3 — x) (2B) 


Figure 2 illustrates the steps of our analysis 
for each bycatch population, using the relatively 
data-rich Northwest Atlantic loggerhead turtle 
as an example. Materials and methods and table 
S1 describe our analysis for all populations. First, 
we obtain point estimates and approximate un- 
certainty for two of A, A,, and F,, from the lit- 
erature. From these, we calculate point estimates 
and distributions for %T using Eq. 2B (Figs. 1A 
and 2B and fig. S2). We also use information 
from the literature to infer which target fisheries 
may be contributing to bycatch mortality (Fig. 2, 
A and C). 

We then perform a Monte Carlo simulation 
that defines 1000 different “states of the world.” 
In each state, we randomly draw a value of 
%T from its distribution (Fig. 2B), as well as an 
allocation of bycatch mortality among target 
fisheries from the set of identified target fish- 
eries. We weight allocation probabilities by the 
fisheries’ relative efforts, measured by 2010 to 
2012 fishing expenditures (Fig. 2C). We assume 
that bycatch mortality (F.) responds propor- 
tionally to changes in target stock mortality. 
Thus, the percentage reduction in bycatch mor- 
tality in a given state of the world is equal to 
the average change in sampled target stock fish- 
ing mortality at MEY relative to 2010 to 2012 
rates (2). 

In some states of the world, the projected re- 
duction in bycatch mortality at MEY is greater 
than %T7. The bycatch population’s decline is 
thus already halted under economically optimal 
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conditions and current targeting, implying that 
zero cost or targeting improvement is required. 
In states of the world where the projected re- 
duction in bycatch mortality is less than %7, we 
calculate the total cost of reducing bycatch mor- 
tality by %T according to principles of economic 
efficiency (i.e., additional reductions in target 
stock mortality beyond MEY are ordered in 
ascending order of marginal cost). We calculate 
the required targeting improvement as the addi- 
tional percentage change in bycatch mortality 
required beyond MEY. When %T = 100, fishing 
or bycatch must cease entirely, so the required 
cost or targeting improvement is 100%. Our Monte 
Carlo analysis thus yields distributions of %7 and 
expected reductions in bycatch mortality (Fig. 2D), 
as well as costs (Fig. 2E) and targeting improve- 
ments (Fig. 2F) required to halt the decline of 


each bycatch population. 

In 95% of simulated states of the world, halting 
the declines of 7 to 13 populations (median 10) is 
fully accomplished by managing target stocks to 
MEY, or requires only minor loss in total profit 
(<5%) (Figs. 3 and 4 and figs. S2 and $3). In >50% 
of states of the world, this includes seven turtles, 
one pinniped, one cetacean, and two birds (Fig. 3). 
Required costs are often substantial (>50%) for 
the remaining populations. Even eliminating 
bycatch completely is insufficient to halt declines 
of one turtle and one bird in most states of the 
world, owing to other mortality sources. Targeting 
improvements required for recovery are always 
slightly larger than required profit losses (Figs. 
3 and 4 and fig. S4), because long-term profits 
are insensitive to small deviations from the exact- 
ly optimal fishing pressure [(/7) demonstrates 
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Cost (percent of MEY) 


Targeting requirement 


trated using the Northwest Atlantic 


loggerhead turtle population. From the literature, we determine (A) its geographic range 


(green, nesting sites in purple) (18, 19), (B) the jo 


int distribution of its rates of abundance 


change currently (A) and without bycatch (A,)—from which we sample the mortality reduction 
needed (%T) (dots, gray indicates %T < 0)—and (C) the target stock groups implicated in 
bycatch and their relative contributions to mortality. (C) The effort (measured as average 
2010 to 2012 fishing expenditures) and projected reduction in mortality under MEY in 

each target fishery (2). From this information, we use Monte Carlo simulation to estimate 
distributions on (D) %T (gray) and the percentage reduction in bycatch mortality 


under MEY (blue), (E) the fraction of cumulative 


MEY that would need to be forgone, or 


(F) the improvement in targeting (“targeting requirement”) needed to halt the 


population's decline. 
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Fig. 3. Trade-offs. For each bycatch population, we compare the median projected rates of 
population change (A) under current conditions (open red circles) and with all target stocks fished 
at the profit-maximizing rate (denoted Fyey) (filled colored circles). Arrows illustrate the effect 

of transitioning to MEY. Bycatch populations whose target stocks are currently fished at lower 
rates than Fyey on average experience greater mortality at MEY (left-facing arrows), and vice 

versa (right-facing arrows). Sizes and colors of filled circles respectively represent median 

required costs (as a percentage of MEY) and median targeting requirements (percentage reduction 
in bycatch mortality, starting from MEY). Gray color indicates that the decline would continue 

even if bycatch were completely eliminated. 
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Fig. 4. Summary. At each cost level (% of MEY, green) and targeting improvement level (%, orange), 
the fraction of populations increasing in abundance is shown. Lines represent means; shaded 
regions represent 95% of states of the world. 
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this principle for catch]. Efficiently ordering 
reductions in fishing pressure among fisheries 
to minimize costs enhances this insensitivity 
(fig. S5). 

Given the data limitations associated with both 
fisheries bycatch (4) and assessing the status of 
target fisheries lacking formal stock assessments 
(2), we urge cautious interpretation of our results 
for any specific bycatch population, some of which 
have a large uncertainty (figs. S2 to S4). Each 
population would benefit from a locally tailored 
follow-up study. However, several broader con- 
clusions are robust to both these uncertainties 
and a wide range of sensitivity analyses (materials 
and methods and figs. S6 to S8). 

First, our results suggest that recovery of ap- 
proximately half of the world’s marine mammals, 
turtles, and birds most threatened by fishery 
bycatch could be achieved as a collateral benefit 
to ending overfishing of target stocks. Given that 
achieving MEY and MSY would respectively re- 
quire 52% and 33% reductions in fishing mortality 
for the median target stock (2), it makes sense 
that this alone could allow many threatened by- 
catch populations to recover. Marine turtles and 
cetaceans in developing-world waters stand to 
benefit in particular (Fig. 3). These populations 
are caught in coastal trawl and gillnet fisheries 
targeting shrimp and finfish (72, 13), which are 
estimated (2) to need the greatest average reduc- 
tions in fishing effort to achieve MEY (Fig. 1B and 
fig. S1). However, MEY reference points for shrimp 
fisheries may need to be refined to account for 
their highly variable, environmentally driven re- 
cruitment (14). 

Second, we project that recovery of some 
bycatch populations would require substantial 
profit losses or targeting improvements. These 
bycatch populations tend to be caught in fisheries 
whose target stocks are already sustainably har- 
vested [e.g., the New Zealand sea lion (Phocarctos 
hookeri)], require total or near-total elimination 
of bycatch to persist (e.g., the vaquita porpoise), 
or both (e.g., the Maui dolphin). Such bycatch 
populations should thus receive high priority in 
efforts to improve fishery targeting. Recent prog- 
ress in bycatch mitigation efforts suggests that 
substantial targeting improvements are achievable 
(15). In many cases, non-fishery-related threats 
to these populations will also need to be addressed. 

Ending overfishing can benefit fisheries and 
fishers. Our results suggest that it can also con- 
tribute substantially to reducing global bycatch 
of threatened species. Of course, ending overfishing 
is not easy. In many places, it will require new 
institutions and infrastructure, combined with 
increases in science and enforcement capacity 
(16). Substantially reducing fishing pressure can 
create short-term hardship for fishing communities 
until stocks recover (2). Rebuilding target stocks 
may also have important—sometimes negative— 
indirect effects on bycatch populations, and vice 
versa [e.g., via competition for prey (77)]. These 
issues deserve attention in future studies. None- 
theless, our conclusions enhance the motivation 
for continued global progress in sustainable fish- 
eries reforms. 
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MALARIA 


GDVI1 induces sexual commitment of 
malaria parasites by antagonizing 
HP1-dependent gene silencing 


Michael Filarsky,”” Sabine A. Fraschka,? Igor Niederwieser,»” 
Nicolas M. B. Brancucci,” Eilidh Carrington,” Elvira Carrié,”? 


Suzette Moes,* Paul Jenoe,* Richard Bartfai, 


3 Till S. Voss’?* 


Malaria is caused by Plasmodium parasites that proliferate in the bloodstream. During 
each replication cycle, some parasites differentiate into gametocytes, the only forms 
able to infect the mosquito vector and transmit malaria. Sexual commitment is triggered 
by activation of AP2-G, the master transcriptional regulator of gametocytogenesis. 
Heterochromatin protein 1 (HP1)—dependent silencing of ap2-g prevents sexual conversion 
in proliferating parasites. In this study, we identified Plasmodium falciparum gametocyte 
development 1 (GDV1) as an upstream activator of sexual commitment. We found that 
GDV1 targeted heterochromatin and triggered HP1 eviction, thus derepressing ap2-g. 
Expression of GDV1 was responsive to environmental triggers of sexual conversion and 
controlled via a gdv1 antisense RNA. Hence, GDV1 appears to act as an effector protein 
that induces sexual differentiation by antagonizing HP1-dependent gene silencing. 


eterochromatin protein 1 (HP1) is aconserved 

regulator of heterochromatin formation, her- 

itable gene silencing, and variegated gene 

expression (1). In Plasmodium falciparum, 

HP1-dependent clonally variant expression 
allows parasites to adapt rapidly to environmental 
challenges encountered during infection (2-4). 
For example, immune evasion via antigenic vari- 
ation of PfEMP1 is the hallmark of Plasmodium 
survival. Other processes, such as expression of 
red blood cell (RBC) invasion ligands or nutri- 
ent transporters, are similarly regulated in this 
parasite (4). Most clonally variant genes cluster 
in subtelomeric domains, but some also occur 
in internal heterochromatic regions of chromo- 
somes. In addition, HP1 forms microdomains at 
some euchromatic genes (2). One of these en- 
codes the transcription factor AP2-G, which is 
required for sexual conversion and differentia- 
tion (2, 5-7). HP1-dependent regulation of ap2-g 
controls the rate at which parasites commit to 
sexual differentiation (7). 

To explore the mechanisms regulating HP1 
occupancy in P. falciparum, we identified HP1- 
interacting proteins by liquid chromatography- 
tandem mass spectrometry (LC-MS/MS) analysis 
of native HP1 complexes that were purified by 
coimmunoprecipitation (co-IP) from parasites ex- 
pressing green fluorescent protein (GFP)-tagged 
HP! (7) (Fig. 1A and table S3). We consistently 
observed GDV1 among the potential HP1 inter- 
action partners (table $1). GDV1 is a nuclear pro- 
tein implicated in sexual commitment and early 
gametocytogenesis, but its exact function remains 
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unknown (8). We therefore created a parasite line 
for the conditional expression of fluorescently 
labeled ectopic GDV1 (designated GDV1-GFP-DD) 
(Fig. 1B). Proteins tagged with the destabiliza- 
tion domain (DD) of the immunophilin protein- 
folding chaperone FK506 binding protein (FKBP) 
are proteolytically degraded unless cells are cul- 
tured in the presence of Shield-1, a small-molecule 
ligand stabilizer (9, 10). Thus, GDV1-GFP-DD is 
barely detectable in parasites cultured in the 
absence of Shield-1 (designated 3D7/GDV1-GFP- 
DDO parasites), but its expression is marked- 
ly induced in parasites grown in the presence of 
Shield-1 (3D7/GDV1-GFP-DD parasites) (Fig. 1, 
B and C). In agreement with the co-IP results, 
GDVI1-GFP-DD colocalized with HP1 at the nu- 
clear periphery (Fig. 1C and fig. S1). Furthermore, 
we found that recombinant HP1 and GDV1 formed 
a complex (fig. S1) and that HP1 copurified with 
GDV1-GFP-DD in reverse co-IPs (Fig. 1D and 
tables S2 and S4). The chromodomain-helicase- 
DNA-binding protein 1 (CHD1) and a protein of 
unknown function (PF3D7_1451200) also con- 
sistently copurified with both HP1 and GDVI1- 
GFP-DD (tables S1 and S2). Given that CHD1 plays 
important roles in cell fate decision and hetero- 
chromatin remodeling in other organisms (JJ, 12) 
and that GDV1 is implicated in gametocytogenesis 
(8), this putative regulatory complex may function 
in activating sexual commitment. 

Malaria parasites proliferate by iterative rounds 
of intraerythrocytic replication through schizog- 
ony, merozoite release, and RBC reinvasion. The 
decision to enter gametocytogenesis is made in 
the cell cycle before sexual differentiation; sexu- 
ally committed schizonts release merozoites that 
invade RBCs and differentiate all into either fe- 
male or male gametocytes (13, 14) (Fig. 2A). To 
test whether GDVI1 triggers sexual commitment, 
we divided samples of 3D7/GDV1-GFP-DD°"* 
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parasites and cultured them in the absence or 
presence of Shield-1. After reinvasion, stage I 
gametocytes were quantified by immunofluores- 
cence assays (IFAs) with antibodies to the game- 
tocyte marker Pfs16 (15) (anti-Pfs16). The 3D7/ 
GDV1-GFP-DD\ population displayed a sexual 
conversion rate of 57.2% (+10.0% SD), compared 
with 11.0% (+2.4% SD) for 3D7/GDV1-GFP-DDO* 
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Fig. 1. GDV1 interacts with HP1. (A) Endogenous 
hp1 locus in 3D7/HP1-GFP parasites and 
Western blots with antibodies to HP1 (anti-HP1 
[a-HP1]) of the anti-HP1-GFP co-IP and negative 
neg.) control samples. Results are representative 
of three biological replicates. a-GFP, antibodies 
to GFP (anti-GFP). (B) GDV1-GFP-DD expression 
plasmid and anti-GFP Western blots of 3D7/ 
GDV1-GFP-DD° and 3D7/GDV1-GFP-DD°N 
parasites. Anti-HP1 served as a loading control. 
C) GDV1-GFP-DD/HPI colocalization IFAs 

with 3D7/GDV1-GFP-DD° and 3D7/GDV1-GFP- 
DDN trophozoites (24 to 32 hpi). DAPI, 

4’ 6-diamidino-2-phenylindole; DIC, differential 
interference contrast; Shld, Shield-1. Scale bars, 
2.5 um (0.5 um for the magnified views in the 
rightmost images). Results are representative of 
three biological replicates. (D) Anti-GFP and 
anti-HP1 Western blots of the anti-GDV1-GFP-DD 
co-IP and negative control samples. Results are 
representative of three biological replicates. 
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parasites, and these gametocytes differentiated 
normally into both male and female gametocytes 
and showed a typical female-biased sex ratio 
(Fig. 2B and fig. S2). Moreover, Shield-1 titration 
revealed a positive correlation between ectopic 
GDV1-GFP-DD expression levels and sexual con- 
version rates (Fig. 2C). To test whether endoge- 
nous GDVI1 expression levels similarly correlate 


Fig. 2. GDV1 induces sexual commitment and 
differentiation. (A) Schematic illustrating the 
iterative cycles of schizogony and RBC reinvasion 
(top) or sexual commitment, RBC reinvasion, 
and gametocyte differentiation (bottom). ER and 
LR, early and late ring stages; T, trophozoites; ES 
and LS, early and late schizonts; gen, generation; 
* time point of anti-Pfs16 (a-Pfs16) IFAs. 

(B) Top panels show results from anti-Pfsl16 
IFAs identifying stage | gametocytes. Quantifica- 
tion of Pfs16-positive parasites is shown at the 
right [results are the means for three biological 
replicates (200 infected RBCs counted per 
sample); error bars indicate SD]. Bottom panels 
show Giemsa-stained blood smears revealing 
stage V gametocytes. Scale bars, 5 um. 

(C) Western blot showing GDV1-GFP-DD expres- 
sion in the presence of increasing Shield-1 
concentrations. Antibodies to glyceraldehyde-3- 
phosphate dehydrogenase (a-GAPDH) and 

HP1 served as loading controls. Percentages of 
Pfsl6-positive parasites are shown at the bottom 
(400 infected RBCs were counted per sample). 
(D) Endogenous gdv1 locus in 3D7/3xHA-GDV1 
parasites and 3xHA-GDV1/HPI1 colocalization 
IFAs in trophozoites (24 to 32 hpi). Scale bar, 
2.5 um. (E) Sexual conversion rates in 3D7/ 
3xHA-GDVI parasites cultured in the presence or 
absence of choline [results are the means for 
three biological replicates (>190 infected RBCs 
counted per sample); error bars indicate SD]. 
(F) Western blot showing 3xHA-GDVI1 expression 
levels in 3D7/3xHA-GDV1 parasites cultured in 
the presence or absence of choline. The arrow- 
head indicates the intact full-length protein. 
Anti-HP1 served as a loading control. Percentages 
of 3xHA-GDV1-positive (3xHA-GDV1-pos.) 
parasites in the presence or absence of choline 
are shown on the right [results are the means 
for three biological replicates (>100 infected 
RBCs counted per sample); error bars indicate 
SD]. (G and H) Comparison of mean expression 
levels for all genes in 3D7/GDV1-GFP-DDON 
parasites versus 3D7/GDV1-GFP-DD° parasites 
(G) and F12/GDV1-GFP-DD®N parasites versus 
F12/GDV1-GFP-DD°F parasites (H). Signifi- 
cantly deregulated genes are indicated by 
circles [mean fold change (FC) cutoff, >1.5; 
q-value (false discovery rate) cutoff, <O.15]. 
Known early gametocyte markers (7, 8, 17) 

are labeled orange. Line graphs show fold 
changes in expression across seven consecu- 
tive time points. pfs16, pfg27, and gexpO2, 
early gametocyte markers (15, 22, 32); pk4 
(PF3D7_0628200) and glu-tRNA-!. 
(PF3D7_1331700), control genes (7). 
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with gametocyte conversion, we used CRISPR- 
Cas9-based gene editing to append a triple he- 
magglutinin (3xHA) tag to the N terminus of 
GDVI1 (yielding 3D7/3xHA-GDVI parasites) (fig. 
$3). Endogenous 3xHA-GDVI colocalized with 
HP1 as expected (Fig. 2D and fig. S3) but was 
expressed only in some parasites. We next quan- 
tified 3xHA-GDV1 expression under conditions 


that either suppress or favor sexual conversion. 
To this end, we made use of the recent discovery 
of choline as an inhibitor of sexual commitment 
(16). 3D7/3xHA-GDVI parasites cultured in the 
presence or absence of 2 mM choline displayed 
sexual commitment rates of 1.8% (+0.3% SD) or 
30.9% (+3.8% SD), respectively (Fig. 2E). Para- 
sites cultured in the absence of choline showed 
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markedly increased 3x HA-GDVI expression levels 
(Fig. 2F). This increase was accounted for by a 
higher proportion of 3x HA-GDV1-positive cells 
[48.6% (+3.4% SD) in the absence of choline com- 
pared with 16.4% (+1.8% SD) in the presence of 
choline] (Fig. 2F) and higher 3x HA-GDV1 expres- 
sion levels in individual 3xHA-GDV1-positive 
parasites (fig. S3). Together, these results show 
that GDV1 activates sexual conversion in a dose- 
dependent manner and that endogenous GDV1 
expression can be induced by environmental 
signals triggering sexual commitment. 

We next performed comparative transcriptome 
analyses using two-color microarrays. 3D7/GDV1- 
GFP-DD™ ring-stage parasites were split and 
cultured separately in the absence or presence 
of Shield-1, and total RNA was harvested at sev- 
en paired time points spanning the remaining 
24 hours of generation 1 [24 to 32 hours post- 
invasion (hpi), 32 to 40 hpi, and 40 to 48 hpi] 
and the first 40 hours after reinvasion in genera- 
tion 2 (8 to 16 hpi, 16 to 24 hpi, 24 to 32 hpi, 
and 32 to 40 hpi) (Fig. 2A). As expected, GDVI1- 
GFP-DD expression triggered a transcriptional 
response characteristic of sexual commitment 
and early differentiation. This effect was evident 
from the induction of ap2-g in generation 1, fol- 
lowed by the activation of early gametocyte mark- 
ers (5, 7, 8, 17) after reinvasion (Fig. 2G, fig. S4, 
and table S5). In the parasite line F12, a 3D7- 
derived gametocyte-deficient clone carrying a 
loss-of-function mutation in ap2-g (5, 18), GDV1- 
GFP-DD expression still activated ap2-g but failed 
to launch a sexual differentiation response (Fig. 
2H and table S6). Next to ap2-g, only eight other 
genes, all of which are marked by HPI, were 
significantly induced in F12/GDV1-GFP-DD°N 
parasites. This set included dblmsp2, which was 
also induced in 3D7/GDV1-GFP-DDON parasites 
(Fig. 2, G and H). Given that DBLMSP2 is a 
merozoite surface antigen expressed only ina 
small subpopulation of schizonts (79, 20), the 
GDV1-dependent activation of the dblmsp2 locus 
suggests that DBLMSP2 may be expressed spe- 
cifically in sexually committed schizonts. In sum- 
mary, these findings show that GDV1 is an upstream 
activator of sexual commitment and likely triggers 
this process by antagonizing HP1-dependent si- 
lencing of ap2-g. 

To test whether GDV1 associates with hetero- 
chromatin in vivo, we conducted comparative 
chromatin immunoprecipitation-sequencing 
(ChIP-seq) experiments. 3D7/GDV1-GFP-DD°™* 
parasites were split at 28 to 34 hpi and cultured 
in parallel in the absence or presence of Shield-1, 
and paired chromatin samples were harvested 2 
(30 to 36 hpi), 6 (34 to 40 hpi), and 10 (38 to 44: hpi) 
hours after Shield-1 addition. We found that 
(i) GDV1-GFP-DD associates specifically with 
heterochromatin throughout the genome (Fig. 3A, 
fig. S5, and table S7), (ii) GDV1-GFP-DD occupancy 
was markedly higher in 3D7/GDV1-GFP-DD©% 
parasites than in 3D7/GDV1-GFP-DD™* parasites 
(fig. S5 and table S7), and (iii) GDV1-GFP-DD oc- 
cupancy was highly correlated with that of HP1 
(Fig. 3B). Moreover, GDV1-GFP-DD occupancy 
peaked 6 hours postinduction and decreased 
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Fig. 3. GDV1 associates with heterochromatin throughout the genome and triggers HP1 
removal at ap2-g. (A) Plots of the ratio of the HP1 level to the input for 3D7/GDV1-GFP-DDOFF 
schizonts [38 to 44 hpi; time point 3 (TP3)] (gray), ChIP-seq subtraction tracks displaying 
relative enrichment of GDV1-GFP-DD in 3D7/GDV1-GFP-DD® schizonts at 2 (30 to 36 hpi: TPL), 
6 (34 to 40 hpi; TP2), and 10 (38 to 44 hpi; TP3) hours after Shield-1 addition (blue), and the 
relative depletion of HP1 in 3D7/GDV1-GFP-DD parasites at TP3 (green). chr12, chromosome 12; 
bp, base pairs. (B) Correlation between GDV1-GFP-DD enrichment in 3D7/GDV1-GFP-DD 
schizonts (34 to 40 hpi; TP2) and HP1 enrichment in 3D7/GDV1-GFP-DD°F schizonts at each 
coding region. r, Pearson correlation coefficient. (C) Fold change in HP1 enrichment upon GDV1-GFP-DD 
overexpression in relation to HP1 enrichment in 3D7/GDV1-GFP-DD°F schizonts for each hetero- 
chromatic gene. (D) Zoom-in view of the enrichment and subtraction tracks at the ap2-g locus. 
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substantially thereafter (Fig. 3A, fig. S5, and table 
87). This drop in GDV1-GFP-DD signal coincided 
with reduced HP1 occupancy over heterochro- 
matic genes in 3D7/GDV1-GFP-DD parasites 
compared with that in 3D7/GDV1-GFP-DD°** 
parasites (Fig. 3, A and C, and table S7). Although 
the vast majority of heterochromatic loci, in par- 
ticular those displaying high HP1 occupancy, such 
as the var genes encoding PfEMP1, displayed only 
slightly decreased HP1 levels, some genes exhib- 
ited as much as 40% reduction in HP1 occupan- 
cy (Fig. 3C and table $7). This group of genes 
included ap2-g and most known HP1-associated 
early gametocyte markers, including geco (21), 
pfgexpl7 (22), and pfe14_748 (8, 17) (Fig. 3, C and 
D, and table S7). These data are consistent with 
the microarray results, where GDV1-GFP-DD ex- 
pression activated ap2-g and early gametocyte 
genes but had no effect on the expression of the 
bulk of heterochromatic loci, including var genes 
(Fig. 2, G and H). Given the 50 to 60% sexual con- 
version rate observed for 3D7/GDV1-GFP-DD©% 
parasites (see above), a 30 to 40% reduction in 
HP1 occupancy indicates that HP1 may be de- 
pleted at these loci specifically in sexually com- 
mitted parasites, but single-cell approaches are 
required to confirm this hypothesis. Overall, we 
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suggest that GDV1 destabilizes heterochromatin 
and thus allows specific transcription factors to ac- 
tivate expression of ap2-g and other gametocyte- 
specific heterochromatic genes, and this process 
may play an important role in the positive auto- 
regulatory feedback loop proposed to reinforce 
AP2-G expression in committed parasites (5, 6, 23). 
How GDV1 achieves specificity in unlocking spe- 
cific HP1-associated genes despite binding hetero- 
chromatin genome-wide is a challenging question 
to be addressed in the future. 

Because GDVI activates sexual commitment, 
the question arises of how parasites limit GDV1 
expression to prevent sexual conversion in asexual 
schizonts. A recent study identified a multiexon- 
long noncoding gdv/ antisense RNA (asRNA) that 
initiates downstream of the gdv/ locus and over- 
laps with the ATG start codon of gdvI (24), which 
is a hallmark feature of regulatory asRNAs (25). To 
investigate whether the gdvI asRNA participates 
in regulating sexual commitment, we created a 
gdvI asRNA loss-of-function [asRNA knockout 
(asKO)] mutation in F12 parasites (yielding F12/ 
gdvI-asKO parasites) (Fig. 4A and fig. S6). Strand- 
specific RNA sequencing (RNA-seq) analysis iden- 
tified a small set of genes that were consistently 


differentially expressed between F12/gdvI-asKO 
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and F12 wild-type parasites (17 genes were up- 
regulated and 23 genes were down-regulated 
in the mutants) (Fig. 4B and table S8). Similar to 
F12 parasites expressing ectopic GDV1-GFP-DD 
(Fig. 2H), F12/gdvJ-asKO parasites showed marked 
induction of ap2-g, dblmsp2, and two early game- 
tocyte genes (pfgl4_748 and PF3D7_1477400) 
(8, 17), and all except one up-regulated gene are 
HP1-associated genes (Fig. 4B, fig. S6, and table S8). 
gdvI sense transcripts were slightly increased in 
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the F12/gdv1-asKO population, and gdvI anti- 
sense transcripts were undetectable, as expected 
(Fig. 4, B and C; fig. S6; and table S8). These 
results indicated that the gdvI asRNA acts as 
a negative regulator of GDV1 expression. To 
confirm this hypothesis, we tagged endogenous 
GDVI1 with HA in these parasites (yielding an 
F12/3xHA-GDV1/gdv1-asKO population) and ob- 
served that indeed almost all parasites (96.7% + 
2.5% SD) expressed 3xHA-GDVI1 (Fig. 4D and 
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fig. S7). Lastly, we showed that deletion of the 
gdvI asRNA locus in a conditional AP2-G mutant 
resulted in markedly increased production of ga- 
metocytes (fig. S8 and supplementary text). To- 
gether, these findings demonstrate a central role 
for the gdui asRNA in regulating GDV1-dependent 
activation of sexual commitment. We anticipate 
that this mechanism likely involves inhibiting 
GDV1 expression by interference with gdv]J MRNA 
transcription, stability, or translation, similar to 
asRNA-mediated gene regulation in other or- 
ganisms (26). 

We identified GDV1-mediated heterochromatin 
destabilization as an epigenetic control strategy 
regulating sexual cell fate decision in P. falciparum. 
Our discovery of the gdvJ asRNA as a negative 
regulator of sexual commitment is reminiscent 
of long noncoding RNA-mediated control of ga- 
metogenesis in yeasts (27, 28). In Saccharomyces 
cerevisiae, nutritional stress triggers gameto- 
genesis by activating the transcriptional regulator 
inducer of meiosis 1 (IME1) (28). A long noncod- 
ing RNA in the ime promoter and antisense tran- 
scription of 7me4 are key factors in preventing 
IME] expression under noninducing conditions 
(29, 30). These parallels raise the exciting pos- 
sibility that evolutionarily divergent unicellular 
eukaryotes employ conceptually similar regula- 
tory logic to control entry into the sexual phases 
of their life cycles. All Plasmodium species infect- 
ing humans possess a GDV1 ortholog, suggesting 
that GDV1-based regulation of sexual commit- 
ment is conserved in all human-infective malaria 
parasites. In conclusion, our study contributes to 
understanding of the molecular pathway under- 
lying the formation of malaria transmission stages 
and provides opportunities for the development 
of intervention strategies targeting transmission 
of human malaria. 
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Precursors of logical reasoning 
in preverbal human infants 


Nicol6é Cesana-Arlotti,’”* Ana Martin,’* Erné Téglas,” Liza Vorobyova,”” 


Ryszard Cetnarski,*? Luca L. Bonatti** 


Infants are able to entertain hypotheses about complex events and to modify them rationally 
when faced with inconsistent evidence. These capacities suggest that infants can use 
elementary logical representations to frame and prune hypotheses. By presenting scenes 
containing ambiguities about the identity of an object, here we show that 12- and 19-month-old 
infants look longer at outcomes that are inconsistent with a logical inference necessary to 
resolve such ambiguities. At the moment of a potential deduction, infants’ pupils dilated, 
and their eyes moved toward the ambiguous object when inferences could be computed, 

in contrast to transparent scenes not requiring inferences to identify the object. These 
oculomotor markers resembled those of adults inspecting similar scenes, suggesting that 
intuitive and stable logical structures involved in the interpretation of dynamic scenes may 


be part of the fabric of the human mind. 


ifty years ago, Piaget argued that logic in 

the mind is the culmination of a long devel- 

opmental process, extending into adoles- 

cence. Forty years ago, Fodor answered that 

if learning implies testing hypotheses, then 
learners must possess the representational re- 
sources to formulate them, including logical prim- 
itives: rule-like combinatorial concepts embedded 
in a compositional system of representation, or a 
language of thought (J). 

After four decades, we still lack insight into the 
nature and development of the logical represen- 
tations, if any, that structure infants’ thinking 
and problem-solving. Partly, this profound lack 
of knowledge stems from the widespread belief 
that infant cognition relies on independent mod- 
ules, functioning early and efficiently, but not 
supported or connected by general reasoning 
(2). Partly, it stems from the assumption that al- 
though logical representations are involved in 
processing language, and hence are present in 
organisms that master a natural language, it is 
difficult, perhaps impossible (3), to identify them 
in nonverbal organisms (4). Acquiring language 
does improve cognition, perhaps also by creat- 
ing novel logical representations (5). However, 
none of these considerations weakens the real 
force of Fodor’s argument, although its premises 
need to be reappraised. Although infants possess 
learning mechanisms that do not require hypothe- 
ses [e.g., bottom-up tracking of statistical regu- 
larities (6)], flexible and productive hypothesis 
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testing does begin in infancy, with a vengeance. 
Infants can generate hypotheses about uncertain 
future events (7), flexibly adapting them to novel, 
albeit subtle, elements of a situation (8, 9). They 
measure the evidence in support (0) and test 
alternative hypotheses when violations occur 
(11, 12). Such abilities extend far beyond precom- 
piled mechanisms for domain-specific responses, 
demonstrating a high degree of rationality in sev- 
eral domains. One prominent account of them 
depicts infants as precocious Bayesian reasoners. 
However, most Bayesian theories require a log- 
ical scaffolding to formulate, test, and modify 
hypotheses (8, 13-15). Thus, characterizing the 
basic logical representations available to pre- 
verbal infants for formulating hypotheses re- 
mains fundamental to understanding the very 
nature of knowledge acquisition (16). 

Here we begin investigating the developmen- 
tal precursors of such scaffolding, looking for 
behavioral correlates of one simple logical repre- 
sentation and rule: disjunction (either A or B) and 
disjunctive syllogism (not A, therefore B). Although 
elementary, this schema grounds one crucial 
hypothesis-testing strategy: Sherlock Holmes- 
like case-by-case analysis of different possibilities, 
excluding alternatives until the culprit is found. 
Attempts to find clear evidence of disjunctive 
syllogism in nonhuman animals have so far been 
inconclusive (4, 17). A related reasoning pattern 
has been studied in toddlers’ and preschoolers’ 
word-learning strategies (78, 19), but it is un- 
known whether it is within the conceptual rep- 
ertoire of preverbal infants. We first investigate 
whether infants can frame disjunctive hypothe- 
ses and make inferences by logically eliminating 
alternatives, testing their reactions to outcomes 
that violate conclusions of this deductive process. 
Then, we identify markers of inferential activity 
by examining the dynamics of oculomotor re- 
sponses during inference making. Last, we ex- 
plore stability across development by comparing 
the oculomotor responses of infants, toddlers, 
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and adults passively looking at nonverbal scenes 
that potentially involve logical inferences. 

We studied 12- and 19-month-old infants, two 
ages at the onset of speech production and lan- 
guage learning but that precede the development 
of extensive language knowledge. We presented 
infants with scenes injected with ambiguity 
about the identity of an object, which could be 
resolved through disjunctive syllogism. In exper- 
iments 1 and 2, two objects different in shape, 
texture, color, and category, but with identical 
top parts (say, a dinosaur and a flower), enter a 
virtual theater (Fig. 1 and fig. S1). An occluder 
hides them, and a cup scoops one of them from 
behind it, with only the top part visible. Thus, 
infants cannot know the identity of the scooped 
object and may establish a disjunctive represen- 
tation. Then, the occluder moves downward, re- 
vealing one object—say, the dinosaur. We call 
this moment the “potential deduction phase,” in 
which infants have evidence to disambiguate the 
identity of the scooped object by disjunctive syl- 
logism. Last, in the “outcome phase,” the dinosaur 
leaves the stage, and the cup reveals the second 
object. Half of the time, the revealed object is 
consistent with the conclusion suggested by the 
logical inference (it is the flower), whereas the 
other half, it is inconsistent (it is the dinosaur). 
We recorded looking time during the outcome 
phase in a violation of expectation (VOE) para- 
digm. Both 12- and 19-month-olds looked longer 
at the inconsistent outcome, suggesting that they 
may have derived the identity of the object in 
the cup through logical inference and were sur- 
prised when this conclusion was violated, as re- 
vealed by mean looking times, M [experiment 1, 
19-month-olds (n = 24), Meonsistent = 7-7 8, 
Minconsistent = 10.5 8, F,23 = 5.79, P = 0.025; exper- 
iment 2, 12-month-olds (n = 24), Meonsistent = 6.2 S, 
Minconsistent = 7-6 8, F,3 = 5.19, P = 0.032; repeated 
measures analyses of variance (ANOVAs)]. 

In experiments 1 and 2, in an inconsistent 
outcome, an object appears twice successively: 
The occluder lowers, revealing the dinosaur, 
which exits the stage, and then the cup reveals 
a dinosaur again (Fig. 1A, vi-2). Conversely, in a 
consistent outcome, two different objects appear 
successively: After the dinosaur exits, the cup 
reveals the flower. Thus, the infants may have 
reacted not to a logical inconsistency but to the 
surface aspects of the final sequence, when the 
same object appeared twice in succession (Fig. 1A, 
vi-1). In experiments 3 and 4, the logical status 
of the final object sequence reverses (Fig. 2A). 
The movies are identical to those of experi- 
ments 1 and 2 until the potential deduction 
phase. There, the occluder never lowers; one 
object (e.g., a snake) exits from its side, remain- 
ing visible for about 1.5 s, then returns behind 
it. In the outcome phase, the cup never reveals 
its content. Instead, another object exits the 
occluder: sometimes the snake again, and some- 
times the other object in the pair (in this example, 
a ball; Fig. 2A, vi). The former outcome is con- 
sistent with the logical inference; however, un- 
like experiments 1 and 2, one single object is 
seen twice in succession. The latter outcome is 
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Potential 
Deduction 
phase 


Consistent Inconsistent 


Looking time (s) 


19 Mos 


12 Mos 


Fig. 1. Infants look longer at outcomes that 
are inconsistent with a logical deduction. 

(A) (i) Two objects with an identical upper part 
enter the theater. (ii and iii) An occluder hides them, 
and a cup scoops one from behind, only the top 
part of which is visible (dinosaur or flower?). (iv and 
v) The occluder lowers, allowing the observer to infer 
the identity of the object in the cup (not dinosaur, 
therefore flower). (vi) The cup reveals its content, 
which is either consistent (flower) or inconsistent 
(dinosaur) with the inference. (B) Mean (+ SEM) 
time spent looking at the outcomes (in seconds). 
Both 19- and 12-month-olds looked longer at the 
inconsistent outcome. *P < 0.05. Mos, months. 
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phase 
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inconsistent, but two different objects are seen 
once in succession. If infants respond to the surface 
aspects of the final sequence, they should disregard 
the logical consistency of the outcome and look 
longer when the single object appears twice, as 
in experiments 1 and 2. If, instead, their behavior 
is guided by a logical inference, they should look 
longer when the outcome is inconsistent with it 
even if the final sequence reverses that of exper- 
iments 1 and 2. Both 19 and 12-month-olds looked 
longer at the inconsistent outcome (Fig. 2C), 
suggesting that they reacted to the logical gist of 
a scene [experiment 3, 19-month-olds (n = 24), 
Meonsistent = 4-9 8, Minconsistent = 6.2 S, Fi23 = 8.5, 
P = 0.008; experiment 4, 12-month-olds (n = 
24), Meonsistent = 4-2 8, Minconsistent = 6-18, F; 123'= i, 
P = 0.003; repeated measures ANOVAs]. These 
results also control for other nonlogical explana- 
tions, such as an object’s magical disappearance 
in the inconsistent outcome of experiments 1 and 
2 (no such disappearance occurred in experi- 
ments 3 and 4) or its greater featural variability 


16 March 2018 


Cc 12 

~ 12 Mos 

ace 

E 8 kk * 
Zz 4 

[o) 

fo) 

ei 


Inference 


No-Inference 


KKK 


12 
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Fig. 2. Infants’ logical reasoning does not 
depend on how the scene is physically 
realized. (A) (i to iii) Inference condition: 
The identity of the object in the cup cannot 
be determined before the potential 
deduction phase. (B) (i to iii) No-inference 
condition: The cup scoops the object in 

full view, so its identity is known. (iv and 

v) Potential deduction phase [common to 
(A) and (B)]: Only in the inference condition 
is a deduction needed to determine the cup 
content. Note that the physical realization 
of this phase is very different from that of 
experiments 1 and 2. (vi-l and vi-2) Outcome 
phase [common to (A) and (B)]: An object 
exits the occluder, yielding a consistent 

or inconsistent outcome. (C) Mean (+ SEM) 
time spent looking at the outcomes (in 
seconds). Both 12- and 19-month-olds 
looked longer at the inconsistent outcome. 
*P < 0.05; **P < 0.01; ***P < 0.001. 


Inference 


in experiments 3 and 4 (reversed in experi- 
ments 1 and 2). 

VOE only measures a response post hoc, after 
a conclusion has been reached (20). Adults reason- 
ing with language make disjunctive inferences as 
early as they have the relevant evidence (21, 22). 
The data reported so far do not characterize the 
unfolding of an inference in the infant mind. To 
explore this, we analyzed oculomotor responses 
during the potential deduction phase. We created 
novel scenes identical to experiments 3 and 4 in 
the potential deduction and outcome phases, but 
requiring no inference to identify the object in 
the cup (experiments 5 and 6; Fig. 2B). We did 
this by showing the cup scooping one object in 
full view before occlusion. Thus, unlike experi- 
ments 3 and 4, in experiments 5 and 6, infants al- 
ready know which object is in the cup before the 
potential deduction phase. As expected at these 
ages (23), infants looked longer at an outcome 
inconsistent with the identity of the (known) 
object in the cup [experiment 5, 19-month-olds 
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Fig. 3. Pupil dilation and eye position 
during the potential deduction phase may 
signal online logical inferences. (A) Sub- 
components of the potential deduction phase, 
time-adjusted to participants’ ages and color- 
coded in indigo shades. (iv-a) An object 

exits the occluder. (iv-b) It stops by the cup 
containing a second object whose identity 

is either known (no-inference condition) 

or ambiguous (inference condition). In the 
inference condition, this is the first moment in 
which participants have evidence available 
that disambiguates the cup content. (v-a) 

The visible object returns behind the occluder. 
v-b) The object remains hidden inside the 
cup, onstage. (B to D) Temporal course of 
pupil dilation changes (in millimeters) from 
baseline and mean x gaze positions for 

B) 12-month-olds, (C) 19-month-olds, and 
D) adults in the inference (blue) or no- 
inference (red) condition. Conditions were 
distributed between participants for infants 
and within participants for adults. Data are 
plotted starting when the mean x coordinates 
of the two conditions converged on the 
object emerging from the occluder. Error 
bars are SEM for infants and 95% within- 
participants confidence intervals for adults. 
Yellow bars indicate regions of differences 

in the two conditions (cluster-based permuta- 
tion tests; supplementary materials). 
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ages, participants who had to perform a logical inference to identify the cup content 
had pupils that were more dilated and gazed more toward the cup. 


(n a 24), Meonsistent = 3-8 8, Minconsistent = 8.3 Ss, 
F,,93 = 26.1, P = 0.0001; experiment 6, 12-month- 
olds (n =24), Moonsistent = 4-9 8, Minconsistent = 6.2 8, 
F,93 = 4.9, P = 0.037; repeated measures ANOVAS]. 
But our focus here is the temporal course of oc- 
ulomotor responses during the potential deduc- 
tion phase, which we expected to be modulated 
by the need for an inference. Cluster-based per- 
mutation tests (24, 25) revealed that at several 
points during the potential deduction phase, 
the infants’ pupils dilated more when the scene 
licensed an inference than when it did not, sug- 
gesting increased cognitive activity possibly due 
to inference-making. By the end of this phase, 
infants also displaced their eyes toward the cup 
more markedly (Fig. 3, B and C, and supplemen- 
tary materials) and switched their focus from the 
visible object to the cup in more trials (Minference = 
71%, Mao inference = 50%, Fygg = 10.4, P = 0.002, 
two-way ANOVA; fig. S2 and supplementary 
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materials) when a deduction was needed than 
when it was not. 

Only when the potential deduction phase af- 
forded an inference did higher pupil dilation and 
visible object-to-cup shifts contribute to predict- 
ing success at identifying inconsistencies in the 
later outcome phase. No such predictive relation 
occurred absent the need for an inference (fig. S3 
and supplementary materials). The fact that this 
relation occurred only when an inference may 
have been involved suggests that oculomotor 
markers in the potential deduction phase are not 
simply due to memory of past event structures but 
are tied to some kind of mental inference about 
the identity of the object in the cup, drawn soon 
after infants acquire the disambiguating evidence. 

Thus, these oculomotor markers suggest that 
preverbal infants efficiently deploy logical proce- 
dures to process the components of an unfolding 
scene. To assess developmental stability, we in- 
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spected adults’ oculomotor responses during the 
same potential deduction phase. Adults (experi- 
ment 7, n = 30) saw 96 scenes patterned upon 
those of experiments 1 to 6. Like infants, during the 
potential deduction phase, adults’ pupils dilated 
more, and their eyes tended to look more toward 
the cup, when the scene licensed an inference. Again, 
this occurred regardless of its physical realization. 
Adults and infants differed only in the speed of 
such markers, but not qualitatively (Fig. 3D, figs. 
S4 and S5, and supplementary materials). 

Our data document the early presence of 
primitive logical abilities. Without instructions or 
tasks, infants spontaneously reason logically while 
ascene unfolds. Specific behavioral markers can 
be used to study the precise temporal course of 
their reasoning process. Because such markers 
already appear at ages when language develop- 
ment has barely begun, our data suggest that 
precursors of logical reasoning are independent 
of language acquisition. Their stability across 
ages and spontaneous deployment suggest that 
some form of elementary logical reasoning may 
be a primitive property of the logical circuitry in 
the human brain (26). Explaining our data with- 
out invoking deductive inferences has a cost. 
Bayesian iterative models, which evaluate the 
most likely of the alternatives first and cycle 
through them when the first choice is discarded, 
could mimic deductive syllogism without assum- 
ing a logical inference in the potential deduction 
phase. However, they require that infants repre- 
sent the space of alternatives (which is equivalent 
to implementing a disjunctive representation), 
assign ordered priors to the alternatives, and as- 
sess alternative evaluations iteratively. A logical 
inference requires fewer assumptions. Thus, al- 
though not incompatible with Bayesian reason- 
ing, the hypothesis that infants perform a logical 
inference in the potential deduction phase is a 
more parsimonious explanation of our results. 

In spite of the stability that we document, 
children begin mastering a verbally expressed 
disjunction late in development (17). However, 
a dissociation between spontaneous inferential 
abilities in nonverbal contexts and their explicit 
verbal counterparts need not imply the lack of a 
concept. Instead, it indicates that mapping the 
spontaneous logical structures of thought onto 
their verbal counterparts is an extremely intri- 
cate process. A deceivingly simple word such as 
“or” has a very complex semantics (27). Unam- 
biguous evidence for its meaning is hard to 
come by, a difficulty that affects the acquisition 
of even simpler abstract words (28). Thus, a con- 
sequence of our research is that much work is 
still needed to understand how the proper align- 
ment between language and thought occurs. 

This empirical evidence is directly relevant to 
the old, yet still fundamental questions debated 
between Fodor and Piaget. Logical representa- 
tions that are crucial components of infants’ nat- 
ural hypothesis-testing attitude are available when 
infants start projecting and testing hypotheses 
about the world. Such representations may con- 
sist of nonlinguistic but fully language-like struc- 
tures, or they may piggyback on sophisticated 
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object representations that can track object iden- 
tities in ambiguous situations. Although fur- 
ther research is needed to clarify their nature, 
our data suggest that intuitive and stable logical 
structures involved in the interpretation of dy- 
namic scenes may be essential parts of the fabric 
of the mind. This does not imply that all logical 
reasoning is spontaneous or innate, just as spon- 
taneous and innate elementary numerical abili- 
ties do not imply that all mathematical knowledge 
is innate. Reasoning occurs in many different 
forms and at many different levels of our mental 
processes, and the gulf separating infant think- 
ing from adult explicit logical reasoning is large. 
However, the development of reasoning abilities 
builds on a natural logical foundation, whose 
profile we are beginning to uncover. 
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NITROGEN CYCLE 


Factoring stream turbulence 
into global assessments of 


nitrogen pollution 


Stanley B. Grant,)?* Morvarid Azizian,” Perran Cook,’ Fulvio Boano,* Megan A. Rippy' 


The discharge of excess nitrogen to streams and rivers poses an existential threat to 

both humans and ecosystems. A seminal study of headwater streams across the United States 
concluded that in-stream removal of nitrate is controlled primarily by stream chemistry and 
biology. Reanalysis of these data reveals that stream turbulence (in particular, turbulent mass 
transfer across the concentration boundary layer) imposes a previously unrecognized upper 
limit on the rate at which nitrate is removed from streams. The upper limit closely approximates 
measured nitrate removal rates in streams with low concentrations of this pollutant, a 
discovery that should inform stream restoration designs and efforts to assess the effects of 
nitrogen pollution on receiving water quality and the global nitrogen cycle. 


ver the past century, humans have sub- 
stantially increased nitrogen loading to 
streams and rivers, primarily from the over- 
application of fertilizer for food production. 

The environmental consequences of this 
nitrogen pollution are evident in both developed 
and developing countries and include eutrophica- 
tion of inland and coastal waters, ocean acidifi- 
cation, and greenhouse gas generation (J-3). 
Thousands of stream, river, lake, groundwater, 
and coastal sites across the United States are 
classified as impaired for nitrogen by the U.S. 
Environmental Protection Agency (4). In a re- 
cent assessment of critical Earth systems required 
for the continued development of human societies, 
nitrogen pollution was identified as one of only 
three planetary boundaries (along with phospho- 
rous pollution and loss of genetic diversity) that 
have already been crossed (5). According to the 
US. National Academy of Engineering, restoring 
balance to the nitrogen cycle is one of the 14 “Grand 
Challenges” facing engineers in the 21st century (6). 
Streams have a natural capacity to remove 
dissolved inorganic nitrogen (DIN, which in- 
cludes nitrate, nitrite, and ammonium) through 
a coupling of physical transport processes and 
biologically mediated reactions in streambed 
sediments (Fig. 1A). DIN is assimilated by auto- 
trophs growing at the sediment-water interface 
(benthic algal layer) and heterotrophic microbial 
populations in the hyporheic zone (7), a region 
of the streambed where hydrologic flow paths 
begin and end in the stream (8). As DIN travels 
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through the hyporheic zone, it undergoes a 
variety of microbially mediated redox reactions, 
including oxidation of ammonium to nitrate (ni- 
trification) and reduction of nitrate to nitrite, 
nitrous oxide, and dinitrogen (denitrification). Of 
these reactions, only denitrification permanently 
removes nitrogen from the stream through the 
evasion of nitrous oxide or dinitrogen gas. The 
production of nitrous oxide by streams is re- 
sponsible for ~10% of global anthropogenic 
emissions of this potent greenhouse gas (9), of 
which headwater streams may account for a 
disproportionate fraction (2). Of the DIN that 
is assimilated, a fraction is stored (for >1 year) 
as particulate nitrogen in streambed sediments 
or in adjacent riparian vegetation (10), whereas 
the rest is remineralized and released back to 
the stream. 

The local efficiency with which DIN is removed 
from a stream can be quantified by one of several 
nutrient-spiraling metrics (77). In our study, 
we focused on nitrate (because of its mobility, 
recalcitrance, and environmental effects) and 
quantified its removal with the nitrate uptake 
velocity v; = 0 (units of meters per second), de- 
fined as the flux of nitrate into the streambed 
divided by the concentration of nitrate in the 
overlying water column. 

The second Lotic Intersite Nitrogen eXper- 
iment (LINX II), which was conducted over 
5 years from 2001 to 2006, remains one of the 
most comprehensive studies of nitrate uptake 
in headwater streams to date (7, 9, 12, 13). LINX 
II included N-labeled nitrate addition experi- 
ments in 72 streams across eight regions of the 
United States, collectively representing eight 
different biomes (temperate rain forest, chap- 
arral, northern mixed forest, deciduous forest, 
montane coniferous forest, temperate grassland, 
shrub desert, and tropical forest) and three dif- 
ferent land-use types (reference streams, urban 
streams, and agriculture streams). On the basis 
of regression and structural equation modeling 
of these data, LINX II researchers concluded that 
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the nitrate uptake velocity is controlled primarily 
by stream chemistry (ambient concentrations of 
nitrate and ammonium) and biology (gross pri- 
mary production and ecosystem respiration) and 
only weakly by stream physics (residence time in 
the hyporheic zone). 

Evaluations of physical controls on nitrate 
uptake in streams have focused on hyporheic ex- 
change (circulation of water through the hypo- 
rheic zone) quantified on the basis of transient 
storage analysis of conservative tracer injection 
experiments (14) or physical models of water 
pumping through streambed sediments by static 
and dynamic pressure variations (2, 8, 15). Missing 
from these previous assessments is turbulent 
mass transport across the concentration boundary 
layer (CBL) above the streambed. This transport 
mechanism is a key control on the delivery of 
oxygen to fine-grained (nonpermeable) sedi- 
ments (16), although its role in mass transfer to 
coarser (permeable) sediments (like most of the 
headwater streams included in the LINX II 
study) is not clear (17). 

Given the CBL’s position between the stream 
and streambed (Fig. 1A), we hypothesized that 
nitrate uptake by permeable streambeds might 
be “bottlenecked” by turbulent transport across 
the CBL. In that event, the uptake velocity can be 
expressed as the product of a mass transfer co- 
efficient X,, that depends solely on stream physics 
(the velocity with which mass is “squeezed” across 
the CBL by turbulence, units of meters per sec- 
ond) and an efficiency o that captures the 
coupled hydrogeology and biogeochemistry of 
nitrate uptake in the benthic algal layer and 
hyporheic zone (the fraction of nitrate delivered 
to the streambed that is removed by assimilation 
and denitrification, unitless) (18) 


Up = Ok, Vp >0, OS aK<1,k p20 (1A) 
a=1 : O<sy<oa (1B) 
eae 

Obed 
= 1C 
v3 (1C) 


Conceptually, the mass transfer coefficient k,, 
represents the potential (mass transfer-limited) 
uptake velocity of a stream, whereas the efficiency 
a indicates the fraction of that potential realized 
in practice. The efficiency depends on a dimen- 
sionless number y, which represents the ba- 
lance of nitrate uptake in the streambed (Up eq, 
units of meters per second) and turbulent mass 
transfer across the CBL. Because efficiency o 
varies from 0 (y—0) to 1 (y—0), if our hypoth- 
esis is correct the uptake velocity should always 
be less than or equal to the mass transfer co- 
efficient: vu; < ky, (See Eq. 1A). 

As atest of our hypothesis, we estimated values 
of the mass transfer coefficient at all LINX II sites 
where uptake velocities were reported for both as- 
similation and denitrification [total uptake (tot), 
units of meters per second] and denitrification 
alone [denitrification uptake (ven), units of meters 
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per second] (69 and 49 of the 72 LINX II sites, 
respectively) (7, 12, 13). Site-specific values of the 
transfer coefficient k,, were estimated from sur- 
face renewal theory, which assumes that mass 
transport across the CBL occurs by sweep and 
ejection events associated with coherent turbu- 
lence in the stream, together with molecular dif- 
fusion of mass into the streambed (19). This theory 
predicts that #,, can be calculated from routinely 
measured features of a stream, including slope 
(S, unitless) and depth (A, units of meters), to- 
gether with temperature-corrected values for the 
kinematic viscosity of water (v, units of square 
meters per second) and the molecular diffusion 
coefficient of nitrate in water (D,,, units of square 
meters per second) 


km = 0.17U,Se 2/3 (2A) 
Se = v/Dm (2B) 
Ux = VghS (2C) 


The Schmidt number (Sc, unitless) represents 
the relative importance of molecular diffusion 
of momentum and mass, the shear velocity (u,., 
units of meters per second) is a measure of 
stream turbulence, and g = 9.81 m s~ is the ac- 
celeration of gravity. Very similar formulae for 
calculating the mass transfer coefficient (Eq. 2A) 
are obtained for different conceptual models of 
the sediment-water interface (e.g., rough versus 
smooth) [reviewed in (17)]. 


Fig. 1. Stream turbulence imposes an upper 
limit on nitrate uptake by assimilation and 
denitrification. (A) Conceptual model of how 
nitrate is transported from the bulk stream, 
across the concentration boundary layer, and 
into the streambed where it is assimilated and 
denitrified in the benthic algal layer and hypo- 
rheic zone. (B) Total uptake velocities 
(accounting for both nitrate assimilation and 
denitrification) measured during the LINX II 
field campaign, plotted against mass transfer 
coefficients calculated from Eq. 2A. Colors 
denote surrounding land use [reference (REF), 
agriculture (AGR), or urban (URB)]. (©) Same as 
(B), except denitrification uptake velocities are 
plotted on the vertical axis. (D) Empirical 
cumulative distributions of total (solid curves) 
and denitrification (dashed curves) efficiencies 
by land-use type. Efficiencies were calculated 
from the ratio of measured uptake velocities and 
site-specific values of the mass transfer 
coefficient calculated from Eq. 2A. 
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With few exceptions and consistent with our 
hypothesis, the LINX II total and denitrification 
uptake velocities conform to the inequality v; < 
ky, (Fig. 1, B and C). The implied removal effi- 
ciencies (computed from the ratio o = v;/k,,) span 
approximately three (10~* < Oden < 0.1) and four 
(10°* < dicot < 1) orders of magnitude for de- 
nitrification and total uptake, respectively (Fig. 
1D). The reduced range for oge, probably reflects 
the restrictive nature of denitrification, which re- 
quires nitrate to be transported into the streambed 
(e.g., by hyporheic exchange) and the presence 
of anoxic conditions and organic carbon, both 
of which may be rate-limiting in some streams 
(10, 12, 13). For the few sites that do not conform 
to the inequality v; < k,,, the total uptake velocity 
exceeds the mass transfer coefficient by factor of 
2 or less, well within the uncertainty of the methods 
used to estimate the mass transfer coefficients (77) 
and uptake velocities (72). 

Removal efficiencies calculated from the LINX 
II data do not exhibit a consistent relationship to 
catchment land use (Fig. 1D), but they are neg- 
atively correlated with stream nitrate concentra- 
tion (Fig. 2). In one of the most notable findings 
to come out of the LINX II study, a similar 
negative correlation was observed between nitrate 
uptake velocity and stream nitrate concentration 
(12). As noted by Mulholland e¢ al., increasing 
nitrate load to a stream could therefore reduce 
the nitrate uptake velocity and elicit “a dis- 
proportionate increase in the fraction of nitrate 
that is exported to receiving waters” (72). Our 
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hypothesis provides a mechanistic explanation 
for this key LINX II finding: Uptake velocities 
are highest in streams with low nitrate concen- 
tration because, under such conditions, all nitrate 
transported to the streambed by turbulence is 
removed by assimilation and denitrification 
(tot * 1 when [NO] < 10°? mol m”) (Fig. 2A), 
and the nitrate uptake velocity is limited by 
mass transfer from the stream to the streambed 
(Ugtot * Km). With increasing nitrate concentra- 
tion, a smaller fraction of nitrate transported to 
the streambed is removed (o;,_ declines with in- 
creasing [NO; ], presumably because sediment- 
associated autotrophic and heterotrophic organisms 
are progressively growth-limited by something other 
than nitrate) and nitrate uptake in the stream- 
bed is inefficient (Ugtot << Am). Denitrification 
efficiencies O.ge, calculated from the LINX II data 
set follow a similar trend (compare panels A and 
B in Fig. 2). Across all stream sites sampled in the 
LINX II study, the denitrification efficiency is a 
roughly constant fraction of the total efficiency 
(Gaen od 0.14.01) (20). 

Our hypothesis also implies a simple scaling 
relationship for the fraction of nitrate removed 
(O < f< 1) over a stream reach of length L (units 
of meters) (27) 


f=l o0| ora (=) sen (3) 


If the goal is to enhance potential nitrate re- 
moval by manipulating stream physics (e.g., 
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Fig. 2. The turbulence-imposed upper limit 
on nitrate uptake is observed in streams 
with low nitrate concentration. (A) The frac- 
tion of nitrate removed in the streambed by 
assimilation and denitrification is negatively 
correlated with stream nitrate concentration 
(coefficient of determination r* = 0.41, P < 0.01) 
and approaches 100% (atot = 1) when nitrate 
concentrations are low ([NO3_] < 10°? mol m’°). 
(B) The fraction of nitrate removed in the 
streambed by denitrification is also negatively 
correlated with stream nitrate concentration (r* = 
0.32, P < 0.01). Lines represent least-squares 
linear regressions of log-transformed efficiency 
against log-transformed nitrate concentration: 
logioa = a + blogio[NO3 J], where the constants 
are a = -2.5+ 0.18 and b = -0.49 + 0.07 for ott 
and a = -3.36 + 0.22 and b = -0.49 + 0.11 for Oden. 


through stream restoration), Eq. 3 indicates that 
the Darcy-Weisbach friction factor fp = 8u?/U? 
[where U (units of meters per second) is the 
average velocity of the stream] and the length- 
to-depth ratio L/h should be maximized, for in- 
stance, using conventional hydraulic relationships 
(22). When stream nitrate concentrations are low 
(i.e., [NO3 ] < 10°? mol m”?) nitrate removal is 
mass transfer-limited and therefore the removal 
efficiencies can be approximated by the follow- 
ing fixed constants: Oyo, * 1 and Ogen ¥ 0.14 (Fig. 2) 
(20). For stream nitrate concentrations above this 
threshold, the results in Fig. 2 imply that nitrate 
uptake is rate-limited by nitrogen cycling and 
transport within the streambed rather than by 
turbulent transport of nitrogen from the stream 
to the streambed. Under these conditions, several 
options are available for estimating o;,,, and 
Oden- The simplest involves substituting into Eq. 3 
the linear correlations between log-transformed 
efficiency and log-transformed nitrate (see lines in 
Fig. 2). When applied to the entire LINX II data 
set, this approach closely reproduces empirical 
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Fig. 3. A test of the scaling law derived in this study. (A) Empirical cumulative distributions of 
the observed (symbols) and predicted (curves) fraction of nitrate removed at LINX II sites by both 
denitrification and assimilation (ftoz) or denitrification alone (fgen). Predicted values of fio¢ and faen 
were calculated from Eq. 3 after substituting the linear regression models for oto ANd Ogen (Fig. 2) 
and site-specific values of the shear velocity, stream velocity, reach length, average depth, and 
stream nitrate concentration (LINX II data tabulated in the supplementary materials). (B) Same 
data as in (A), but plotted so that the observed and predicted values of fio and fyen can be compared 
on a site-by-site basis. The diagonal line represents a one-to-one relationship. 
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Astrocyte-derived interleukin-33 


promotes microglial synapse engulfment 
and neural circuit development 


Ilia D. Vainchtein,’* Gregory Chin,’* Frances S. Cho,”” Kevin W. Kelley,’ 
John G. Miller,’ Elliott C. Chien,’ Shane A. Liddelow,°+ Phi T. Nguyen,”® 
Hiromi Nakao-Inoue,' Leah C. Dorman,” Omar Akil,® Satoru Joshita,®’° 
Ben A. Barres,® Jeanne T. Paz,” Ari B. Molofsky,”+ Anna V. Molofsky't 


Neuronal synapse formation and remodeling are essential to central nervous system (CNS) 
development and are dysfunctional in neurodevelopmental diseases. Innate immune signals 
regulate tissue remodeling in the periphery, but how this affects CNS synapses is largely 
unknown. Here, we show that the interleukin-1 family cytokine interleukin-33 (IL-33) is produced 
by developing astrocytes and is developmentally required for normal synapse numbers and 
neural circuit function in the spinal cord and thalamus. We find that IL-33 signals primarily 

to microglia under physiologic conditions, that it promotes microglial synapse engulfment, and 
that it can drive microglial-dependent synapse depletion in vivo. These data reveal a cytokine- 
mediated mechanism required to maintain synapse homeostasis during CNS development. 


euronal synapse formation depends on 

a complex interplay between neurons and 

their glial support cells. Astrocytes provide 

structural, metabolic, and trophic support 

for neurons (J, 2). Gray-matter astrocytes 
are in intimate contact with neuronal synapses 
and are poised to sense local neuronal cues. In 
contrast, microglia are the primary immune cells 
of the central nervous system (CNS) parenchyma. 
Microglia regulate multiple phases of develop- 
mental circuit refinement (3, 4), both inducing 
synapse formation (5, 6) and promoting synapse 
engulfment (7, 8), in part via complement, an 
effector arm of the innate immune system (9). 
Excess complement activity has been implicated 
in schizophrenia, a neurodevelopmental disorder 
that includes cortical gray matter thinning and 
synapse loss (JO), suggesting that microglial synap- 
se engulfment may have broad implications for 
neuropsychiatric disease. 
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Despite the emerging roles of astrocytes and 
microglia in neuronal synapse formation and 
remodeling, how they coordinate synaptic ho- 
meostasis in vivo remains obscure. Interleukin-33 
(IL-33) is an IL-1 family member with well- 
described roles as a cellular alarmin released 
from nuclear stores after tissue damage, in- 
cluding in spinal cord injury (11, 12), stroke (73), 
and Alzheimer’s disease (74). Whereas many cy- 
tokines are primarily defined by their roles in 
inflammation and disease (e.g., IL-1, tumor ne- 
crosis factor-a, or IL-6), IL-33 also promotes 
homeostatic tissue development and remodeling 
(15). The CNS undergoes extensive synapse re- 
modeling during postnatal brain development, 
but a role for IL-33 or other stromal-derived cy- 
tokines is unknown. Here, we report that IL-33 
is produced postnatally by synapse-associated 
astrocytes, is required for synaptic development 
in the thalamus and spinal cord, and signals to 
microglia to promote increased synaptic engulf- 
ment. These findings reveal a physiologic require- 
ment for cytokine-mediated immune signaling in 
brain development. 

We previously developed methods to identify 
functionally heterogeneous astrocytes by expres- 
sion profiling of distinct CNS regions (6). In an 
RNA-sequencing screen of developing forebrain 
astrocytes (P9) (flow sorted using an AldhIll ere 
reporter), we identified the cytokine IL-33 as a 
candidate that is both astrocyte-enriched and 
heterogeneously expressed by astrocytes through- 
out the CNS (fig. S1, A to C). We confirmed 
astrocyte-specific developmental expression 
of IL-33 in spinal cord and thalamus using a 
nuclear-localized IL-33 reporter (1133’"(?""’*) 
(Fig. 1A) and validated these findings with flow 
cytometry and protein immunostaining (fig. S2). 
By adulthood, a subset of oligodendrocytes also 
colabeled with IL-33 (fig. S2, C to E), consistent 
with previous reports (17). Thus, astrocytes are 
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the primary source of IL-33 during postnatal 
synapse maturation. 

Although most IL-33-positive cells were astro- 
cytes, not all developing astrocytes expressed 
IL-33, and this number increased in the early 
postnatal period (fig. S3) (77). In fact, IL-33 was 
detected only in gray matter, where most synaps- 
es are located (Fig. 1B and figs. S2H and S3D). 
In the thalamus, which receives regionally dis- 
tinct sensory synaptic inputs, IL-33 expression 
in the visual nucleus (ALGN) increased coincident 
with eye opening [postnatal days 12 to 14 (P12 
to P14)] (Fig. 1, C and D). Removal of afferent 
sensory synapses by enucleation at birth pre- 
vented this developmental increase in IL-33 ex- 
pression (Fig. 1, E and F), whereas dark rearing, 
in which synapse maturation is largely preserved 
(18), had no effect. Molecular profiling of IL-33- 
positive astrocytes in both thalamus and spinal 
cord (Fig. 1, G and H) revealed a negative cor- 
relation with white-matter astrocyte markers (Gfap 
and Vimentin), enrichment for genes involved in 
astrocyte synaptic functions (connexin-30/Gjb6) 
(19), and enrichment in G protein-coupled and 
neurotransmitter receptors (e.g., Adora2b and 
Adra2a) (tables S1 to S3 and data S1). Together, 
these data demonstrate that IL-33 expression is 
correlated with synaptic maturation and marks a 
subset of astrocytes potentially sensitive to syn- 
aptic cues, raising the question of whether IL-33 
plays a role in synapse development. 

To determine whether IL-33 regulates neu- 
ral circuit development and function, we ex- 
amined the effect of IL-33 deletion on synapse 
numbers and circuit activity. In the thalamus, 
a region with high IL-33 expression, an intra- 
thalamic circuit between the ventrobasal nucleus 
(VB) and the reticular nucleus of the thalamus 
(RT) displays spontaneous oscillatory activity that 
can also be evoked by stimulating the internal 
capsule that contains cortical afferents (20, 21). 
We quantified this oscillatory activity in slices 
from young adult mice (P30 to P40), which 
revealed enhanced evoked activity in response 
to stimulation (Fig. 2, A and B, and fig. $4, A 
and B), as well as elevated spontaneous firing 
in the absence of IL-33 (Fig. 2C and fig. S4C). 
This increase could result at least in part from 
enhanced numbers of glutamatergic synapses. 
To investigate this hypothesis, we performed 
whole-cell patch-clamp recordings of VB neurons 
to quantify miniature excitatory postsynaptic 
currents (mEPSCs) (Fig. 2D). We found that 
the frequency of mEPSCs was enhanced in 
VB neurons from IL-33-deficient mice, where- 
as the amplitude and the kinetics were un- 
changed (fig. S4D). Together, these results suggest 
that IL-33 deficiency leads to excess excitatory 
synapses and a hyperexcitable intrathalamic 
circuit. 

In the spinal cord, a-motor neurons (o-MN) 
are the primary outputs of the sensorimotor cir- 
cuit and receive inputs from excitatory (VGLUT2"*) 
and inhibitory (VGAT*) interneurons (Fig. 2E) 
(22). We conditionally deleted IL-33 from astro- 
cytes (AGFAPcre) (16) (fig. S5A) and found in- 
creased numbers of excitatory and inhibitory 
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inputs onto a-MN at P30; global deletion of 
Ilirll (ST2) (Fig. 2, F to I) or 1133 (fig. S5, B and 
C) phenocopied this finding. Neuronal soma 
size, interneuron numbers, and oligodendro- 
cyte numbers were unchanged (fig. S5, D to F). 
However, by adulthood, IL-33 deficiency led 
to increased gray-matter expression of glial 
fibrillary acidic protein (GFAP) (fig. S5, G and 
H), a marker of tissue stress. We also found 
that 133°- animals had deficits in acoustic 
startle response, a sensorimotor reflex mediated 
by motor neurons in the brainstem and spinal 
cord (Fig. 2, J and K) (23). Auditory acuity and 
gross motor performance were normal (fig. S5, 
I and J). Taken together, these data demon- 
strate that IL-33 is required for normal synapse 
numbers and circuit function in the thalamus 
and spinal cord. 
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Fig. 1. IL-33 is developmentally induced in synapse-associated 
astrocytes. (A) Representative image of II33™°"°"Y with Aldh1l1o"? 
astrocytes and oligodendrocyte marker CC1 in spinal cord ventral horn 
(scale bar, 50 wm). (B) Gray matter restricted expression of //33!*° in the 
spinal cord at P30 (scale bar, 0.5 mm). (€ and D) 1/33'*°7 increases in 
the visual thalamus (dLGN) during eye opening, normalized to sensorimotor 
thalamus (VB) (scale bar, 0.5 mm). (E) Representative images of 1133! in 
P21 thalamus in littermate controls and after perinatal enucleation (scale bar, 
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To determine the cellular targets of IL-33 sig- 
naling, we first quantified expression of its ob- 
ligate co-receptor ILIRL1 (ST2) (75). We detected 
Ilirll in microglia by RNA sequencing (7.1 + 2.1 
fragments per kilobase of transcript per million 
mapped reads) and by quantitative polymerase 
chain reaction (qPCR), in contrast to astrocytes, 
neurons, or the lineage-negative fraction (Fig. 3A). 
The transcriptome of acutely isolated microg- 
lia from 11337~ animals revealed 483 signifi- 
cantly altered transcripts, including reduced 
expression of nuclear factor «B (NF-«B) targets 
(e.g., Tnf, Nfkbia, Nfkbiz, and Tnfaip3) (Fig. 3, 
B and C; fig. S6A; and data S2), consistent with 
diminished NF-«B signaling (24). The transcrip- 
tome of /133°/~ astrocytes was unchanged (fig. 
S6B), arguing against cell-autonomous roles 
of IL-33 in this context. These data demon- 
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strate physiologic signaling by IL-33 to microglia 
during brain development, raising the ques- 
tion of whether it promotes physiologic microg- 
lial functions. 

Given the increased synapse numbers in IL-33- 
deficient animals, we investigated whether IL-33 
is required for microglial synapse engulfment. 
We detected engulfed PSD-95* synaptic puncta 
within spinal cord microglia throughout devel- 
opment, as in other CNS regions (7, 8), and found 
decreased engulfment in microglia from 11337— 
animals (P15) (Fig. 3D). This was further validated 
by dye labeling of spinal cord motor neurons, 
which revealed fewer dye-filled microglia in 
11337~ (fig. $7). Conversely, local injection of 
IL-33 increased PSD-95 within microglia in 
both spinal cord (Fig. 3E) and thalamus (fig. 
S8, A and B) and altered markers consistent 
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0.5 mm). (F) /133'*°7 mean pixel intensity in dLGN. (G) Representative flow 
plot of spinal cord from 133°" /Aldh11¢"" mice at P15 with sorting 
gates indicated. (H) Heat map of the top 444 differentially expressed 
genes in II33-mCherry* versus mCherry astrocytes in spinal cord and 
thalamus (fold change > 2; adjusted P value < 0.05), select candidates 
highlighted. One-way analysis of variance (ANOVA) with Tukey’s post hoc 
comparison or Student's t test. All points represent independent biological 
replicates. *P < 0.05, ****P < 0.0001. 
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Fig. 2. IL-33 deficiency A 
leads to excess synapses 


stimulating 
electrode 


Evoked C 


33t/* 1133 


and abnormal thalamic 
and sensorimotor cir- 
cuit function. (A) Sche- 
matic of extracellular 
recording setup to 
measure circuit activity 
between ventrobasal 
(VB) and reticular tha- 
lamic (RT) nuclei, with 
representative recording 
showing activity in five 
channels after stimula- 
tion of the internal D 
capsule (i.c.) that 
contains cortical affer- 
ents. Red arrows indicate 
reciprocal VB-RT connec- 
tions. (B) Average traces 
and quantification of 
mean firing rates reveal 
higher evoked firing in Wa angel 
1I33-“. (C) Quantification a) 
of mean firing rates in the 
absence of stimulation 
reveals increased 
spontaneous firing in 
[33--. (D) Representative 
traces and quantification 
of intracellular patch- 
clamp recordings 

from neurons in the 

VB show increased mini- 
ature excitatory post- 
synaptic currents 
(mEPSCs) in 1/33. 

(E) Schematic of motor 


multichannel 
recording 
electrode 


VGLUT2/ 
Chat*/DAPI 


neuron synaptic afferents. H 

(F and G) Representative 

image and quantification a 

of excitatory inputs — 
Ea 

per motor neuron after be a 

conditional deletion S % 

of 1133 (hGFAPcre) or ra] 


global deletion of //Ir/1. 

H and I) Inhibitory (VGAT*) 
inputs in the same mice 
scale bar, 25 um). 

J) Schematic of startle 
pathway. (K) Impaired 


mean + SEM. 


with microglial activation, including ILIRL1- 
dependent down-regulation of P2Y12 (fig. S9, 
A to C) (25). In vitro, IL-33 promoted synapto- 
some engulfment by purified microglia, whereas 
the canonical IL-1 family member IL-1f had no 
effect (fig. S9, D and E). In vivo, injection of IL- 
33 into the developing spinal cord led to twofold 
depletion of excitatory synapses (colocalized 
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VGLUT2/PSD-95), whereas conditional deletion 
of ILIRL1 from microglia partly reversed this 
effect (Cx3cricre:Ilrll”) (Fig. 3, F and G). In 
comparison, global loss of Jlirl1 completely 
reversed IL-33-dependent synapse depletion in 
spinal cord (fig. S10) and thalamus (fig. S8, C and 
D), suggesting that nonmicroglial sources of ST2 
could also contribute. These data indicate that 
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sensorimotor startle in I/33-~ animals. Data in (B) and (C) from wild type: n = 6 to 8 slices, 2 mice. Knockout (KO): n = 14 to 15 slices, 3 mice; points 

are individual recordings. Data in (B) analyzed by Mann-Whitney and (C) with Student's t test. Data in (D) from n = 23 to 25 cells and 3 to 4 mice per 
group, analyzed by Kolmogorov-Smirnov test. Data in (F) to (I) from n = 3 animals, >75 neurons per genotype, Student’s t test; points are individual 
neurons. (K) is n = 12/group, two-way ANOVA with Sidak’s multiple comparisons. *P < 0.05, **P < 0.01, ****P < 0.0001. (B) to (I) are mean + SD; (K) is 


IL-33 regulates synapse numbers in vivo at least 
in part via ILIRL1 receptor-mediated signaling 
in microglia. 

Our data reveal a mechanism of astrocyte- 
microglial communication that is required for 
synapse homeostasis during CNS development. 
We propose that astrocyte-derived IL-33 serves 
as a rheostat, helping to tune microglial synapse 
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Fig. 3. IL-33 drives microglial synapse engulfment during develop- 
ment. (A) Expression of //Ir/1 by qPCR of flow-sorted populations 

(S, spinal cord; T, thalamus). (B) A total of 484 differentially expres- 
sed genes in spinal cord microglia at adjusted P value < 0.05. 

(C) Functionally associated gene clustering (STRING) identifies 
immune genes enriched in wild-type versus //33-7~ microglia. 

(D) PSD-95 puncta within microglia (yellow arrows) after IL-33 
deletion (scale bar, 4 uM). (E) Representative image and quantifica- 
tion of engulfed PSD-95 in vehicle or IL-33 injected spinal cord 


engulfment during neural circuit maturation and 
remodeling (fig. S11). Key unanswered questions 
include the nature of the cues that induce astro- 
cyte 1133 expression, the mechanism of IL-33 
release, and the signals downstream of IL-33 that 
promote microglial function. These data also 
raise the broader question of how this process 
affects neural circuit function. Synapses are the 
most tightly regulated variable in the develop- 
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are mean + SD. 


ing CNS (26) and are a primary locus of dys- 
function in neurodevelopmental diseases. [133 
is one of five genes that molecularly distinguish 
astrocytes from neural progenitors in the devel- 
oping human forebrain (27), suggesting possibly 
conserved roles in the human CNS. Defining 
whether signals like IL-33 are permissive or 
instructive, promiscuous or synapse specific, 
is a first step toward understanding how neural 
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(scale bar, 20 um). (F and G) Colocalized pre- and postsynaptic 
puncta in spinal cord ventral horn at P14 (yellow arrows) after 

IL-33 injection into control mice or in littermates with conditional 
deletion of //1rll (Cx3crlcre) (scale bar, 3 wm). Points in (A) represent 
mice; in (D) to (G), individual microglia from n = 3 to 5 animals 

per group; in (G), images from n = 5 mice per group. In (D) and 

(E), Student’s t test and in (G), a one-way ANOVA with Tukey’s post 
hoc comparison. *P < 0.05, ***P < 0.001, ****P < 0.0001; all data 


circuits remodel during development and under 
stress. 
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CIRCADIAN RHYTHMS 


Rev-erboa dynamically modulates 
chromatin looping to control 
circadian gene transcription 


Yong Hoon Kim,»”** Sajid A. Marhon,””?* Yuxiang Zhang,” David J. Steger,»” 


Kyoung-Jae Won,””*?+ Mitchell A. Lazar?”*+ 


Mammalian physiology exhibits 24-hour cycli 


city due to circadian rhythms of gene 


expression controlled by transcription factors that constitute molecular clocks. Core 
clock transcription factors bind to the genome at enhancer sequences to regulate 
circadian gene expression, but not all binding sites are equally functional. We found 
that in mice, circadian gene expression in the liver is controlled by rhythmic 
chromatin interactions between enhancers and promoters. Rev-erba, a core repressive 
transcription factor of the clock, opposes functional loop formation between 


Rev-erbo-regulated enhancers and circadian 


target gene promoters by recruitment of 


the NCoR-HDAC3 co-repressor complex, histone deacetylation, and eviction of the 
elongation factor BRD4 and the looping factor MED1. Thus, a repressive arm of the 


molecular clock operates by rhythmically mo 
circadian gene transcription. 


ircadian rhythms of mammalian physiology 
are orchestrated by core clock transcription 
factors (TFs) functioning as activators or 
repressors in interlocking feedback loops 
that drive daily oscillations of gene expres- 
sion (J, 2). These TFs generate circadian rhythms 
of histone modification at circadian enhancers 
(3-6). Enhancer activities are mediated by loop- 
ing to promoters within insulated topological- 
ly associating domains (TADs) (7-9). Previous 


Fig. 1. Circadian sub- 
TADs undergo rhythmic 
intra-TAD compaction 
within stable bound- 
aries. (A) Heat maps of 
ZT22 and ZT10 Hi-C 
demonstrating circadian 
intra-TAD interactions 
within sub-TAD bounda- 
ries (dashed lines), as 
represented by Hi-C 
intensity normalized by 
iterative correction and 
eigenvector decomposi- 
tion (ICE). The transcrip- 
tional start site of the 


dulating chromatin loops to control 


studies with cultured cells have described cir- 
cadian regulation of higher-order chromatin 
organization (10), long-range interchromosomal 
interactions (J), and short-range chromatin 
loops (72) at specific loci, but this has not been 
examined at a genome-wide level in native tis- 
sues under normal physiology. 

We performed in situ Hi-C (13, 14) on C57BL/6J 
mouse livers harvested 12 hours apart, at zeitgeber 
time 22 (ZT22, 5 a.m.) and ZT10 (5 p.m.), to ex- 


Npas2 gene (TSS) forms 


ou E1 E2 E3 E4 TSS Gene body 
rhythmic intra-TAD loops 5’ start 2722 3’ end 


with upstream enhancers 
(E1 to E4) as well as 

with the gene body, as il- 
lustrated by schematics 
below. (B and C) ZT22 
sub-TAD (B) and ZT10 
sub-TAD (C) averaged dif- 
ferential changes in intra- 
TAD interactions visualized 


amine whether chromatin interactions change 
in a circadian manner. Hi-C identified megabase- 
size TADs whose boundaries were highly similar 
to those identified in mouse embryonic stem 
cells (fig. SIA). This was expected because TADs 
are largely conserved among different tissues (7), 
and indeed the overall TAD organization was also 
highly similar between ZT22 and ZT10 (fig. S1B). 
Within each TAD we also observed submegabase 
structures (sub-TADs) of different lengths (fig. 
S1C) that were flanked by CTCF and cohesin 
(RAD21) and internally occupied by Mediator 
(MEDI) (fig. SID), as described previously (14-17). 
Globally, genomic occupancy of CTCF and cohesin 
at ZT10 versus ZT22 was very similar at sub-TAD 
boundaries (fig. S1, E and F). 

Because TFs bind at enhancers to regulate 
genes confined within sub-TAD boundaries (16), 
we next searched for “intra-TAD” interactions 
occurring within sub-TADs. Of 6510 intra-TAD in- 
teractions, only 349 were ZT22-specific, whereas 
527 were ZT10-specific (Wilcoxon signed-rank test, 
P < 0.001). For example, the circadian Npas2 gene 
exhibited increased intra-TAD interactions at 
ZT22, including extensive looping between the 
transcriptional start site and four noncoding re- 
gions (E1 to E4) (Fig. 1A). These noncoding regions 
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mapped to regions of divergent transcriptional 
activity characteristic of enhancer RNA (eRNA) 
that were similarly regulated (fig. S2A). Intra-TAD 
interaction was also observed within the gene 
body, as previously reported at actively transcribed 
genes (18, 19), and this was enhanced at ZT22 
as well (Fig. 1A). 

Genome-wide, we identified hundreds of sub- 
TADs containing circadian genes whose expres- 
sion has been shown to peak between ZT21 and 
Z124 (“ZT22 sub-TADs”) or between ZT9 and 
Z112 (“ZT10 sub-TADs”) (6). The overall structure 
of these sub-TADs (fig. S2B) and the binding of 
CTCF and RAD21 at their boundaries (fig. S2, C 
and D) changed very little between ZT22 and 
Z7110 (fig. S2, B to D). However, ZT22 and ZT10 
sub-TADs exhibited greater intra-TAD interac- 
tions corresponding to their transcriptional ac- 
tivities (Fig. 1, B and C), whereas noncircadian 
sub-TADs did not (fig. S3A). Similar conclusions 
were obtained when the circadian windows 
were adjusted by 1 hour (fig. S3, B and C). Inter- 
actions within gene bodies were also circadian 
(fig. S3, D to H). 


The circadian Cry] locus is located within a 
Z1T22 sub-TAD, and Hi-C revealed an interaction 
between the gene promoter and an intronic en- 
hancer that was increased at ZT22 (fig. S4, A 
and B), as best visualized by differential analysis 
of the data (Fig. 2A). The nuclear receptor Rev- 
erba, a repressive component of the mammalian 
clock whose expression peaks at ZT10 to confer 
circadian expression of genes in the opposite 
phase (6, 20, 21), binds at this intronic enhancer 
(Fig. 2A). The enhancer-promoter (E-P) loop 
identified in ZT22 Hi-C was confirmed by chro- 
matin conformation capture (3C) experiments 
(Fig. 2B). Tandem 3C and Rev-erba chromatin 
immunoprecipitation (ChIP) at six time points 
throughout the day revealed that this E-P loop 
was indeed circadian and in phase with Cry1 
mRNA expression, with a peak that was antiphase 
to Rev-erbo. binding (Fig. 2C). At ZT10, both loop- 
ing from the Reyv-erbo. site to the CryJ promoter 
(Fig. 2D) and CryI gene expression (Fig. 2E) were 
enhanced by knockout (KO) of Rev-erba, which 
attenuated the rhythmicity of these parameters 
over the course of 24 hours (fig. S4C). Recip- 


rocally, ectopic expression of Rev-erba in liver 
was sufficient to reduce looping as well as 
mRNA expression at ZT22 (Fig. 2, F and G), con- 
sistent with an active role of Rev-erba in opposing 
loop formation. We next performed Hi-C on livers 
from mice genetically lacking Rev-erbo. and har- 
vested at ZT10, which confirmed the enhanced 
E-P looping at the CryI locus (fig. S4D). More- 
over, throughout the genome, intra-TAD inter- 
actions that were normally favored at ZT22 were 
increased in the genetic absence of Rev-erbo. at 
ZT10 (Fig. 2H). 

These findings suggested that the ability of Rev- 
erbo to oppose loop formation is a critical feature 
at binding sites from which Rev-erba actively 
represses transcription. To explore the relation- 
ship between regulation of looping and functional 
repression, we identified Rev-erbo binding sites 
that directly loop to promoters at ZT22 in the 
physiological absence of Rev-erba. E-P loops at 
Rev-erba binding sites were defined as “engaged” 
when transcription of the gene body was repressed 
at ZT10, or “passive” if not repressed (Fig. 3, A 
and B). Engaged sites were highly correlated 
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Fig. 2. Rev-erba causally opposes enhancer-promoter loop formation. 
(A) Differential Hi-C analysis at the Cry1 locus revealing ZT22-specific 
interactions, represented as logs ratio (ZT22 Hi-C/ZT10 Hi-C). ZT22- 
specific interactions (dashed circle) occur between a region around the 
intronic Rev-erba site (red) and the Cryl TSS (blue). Global run-on 
sequencing (GRO-seq) demonstrates circadian nascent transcription 
as well as the presence of bidirectional eRNA at the Rev-erba site at 
Z122. (B) 3C validation of enhancer-promoter loop (E-P loop) identified 
at ZT22 between the Rev-erba site (red) and TSS (blue) (n = 5, 

mean + SEM). (C) Circadian plot demonstrating Cryl E-P loop, mRNA 
expression, and Rev-erba ChIP + SEM [n = 4 or 5, P values shown for 3C 
and ChIP peaks compared to troughs, one-way analysis of variance 
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Z1T22 sub-TAD average 


Rev-erba site 


TSS (bait) 


Log,(aKO/ZT10) Hi-C 
normalized differential 
1.0 05 00 -05 -1.0 


a Higher intensity at 
“es  <akO | 2T10> 


Rev-erba. ChIP % input 


3’ end 


13.17 21 


(ANOVA) followed by multiple-comparisons correction with the Tukey 
method]. (D and E) E-P loop (D) and mRNA expression (E) of Cryl at 
ZT22 (black), ZT10 (white), and ZT10 Rev-erba knockout (KO) (red), 
represented as mean + SEM (n = 4, one-way ANOVA followed by 
Dunnett multiple-comparisons test). (F and G) E-P loop (F) and mRNA 
expression (G) of Cryl at ZT22 with control green fluorescent protein 
(GFP) expression (black) versus Rev-erba overexpression (blue) 
expressed as mean + SEM (n = 5, two-tailed Student t test). 

(H) Same analysis as in Fig. 1B, but comparing ZT10 Rev-erba KO 
(aKO) to wild-type ZT10 at ZT22 sub-TADs (red, higher interaction ratio 
at ZT10 aKO; blue, higher interaction ratio at wild-type ZT10). *P < 0.05, 
SEP O01 ***P <= O1001,.****P:< 00001. 
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Fig. 3. Rev-erbo. attenuates enhancer-promoter looping at functional 
binding sites. (A) Rev-erba sites at E-P loops were classified as engaged 
when looped to genes whose transcription was repressed at ZT10 relative 
to ZT22, and otherwise classified as passive. (B) Gene body transcription 
change between ZT22 and ZT10 at engaged versus passive sites. Engaged 
target genes were defined as those displaying gene body transcription 
relative change of 21.5 between ZT22 and ZT10. (C) Engaged Rev-erboa 
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Fig. 4. Functional Rev-erba binding evicts BRD4 and MED1 from sites of looping. (A) Greater recruitment 
of NCoR1 and HDAC3 at engaged Rev-erba sites associated with a slight average increase in Rev-erba binding 
(Mann-Whitney tests). (B) Circadian deacetylation of H3K27Ac at circadian time 21 (CT21) and CT9 at engaged 
versus passive sites [data sets from (5)]. (C) Circadian eRNA transcription between ZT22 and ZT10, with 
increased transcription at ZT10 in aKO at engaged sites [data sets from (6)]. (D and E) Circadian eviction of 
BRD4 (D) and MED1 (E) between ZT22 and ZT10, with enhanced binding at ZT10 in aKO at engaged sites (Dunn 
multiple-comparisons tests after one-way ANOVA/Friedman test). (F and G) ChIP-qPCR validation of BRD4 (F) 
and MED1 (G) eviction at ZT10 and enhanced binding at ZT10 in aKO at engaged sites (Ins as a negative control, 
n = 3 or 4, two-way ANOVA followed by Dunnett multiple-comparisons test). For box plots in (A), (D), and (E), 
whiskers denote the 10th and 90th percentiles. *P < 0.05, **P < 0.01, ****P < 0.0001; ns, not significant. 
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sites were highly correlated with ZT18-24 circadian eRNAs (6) (within 
+2 kbp, one-tailed hypergeometric tests). (D) Engaged Rev-erba sites 
were confined within sub-TADs that contain circadian genes peaking at 
Z118-24 (one-tailed hypergeometric tests). (E) E-P loops between en- 
gaged Rev-erba binding sites and target gene promoters were stronger 
at ZT22 than at ZT10 (Mann-Whitney test). For box plots in (B) and 
(E), whiskers denote the 10th and 90th percentiles. 


with circadian eRNAs whose activity 
peaked around ZT18 to ZT24 (ZT18- 
24) antiphase to Rev-erbo. binding (Fig. 
3C), demonstrating that engaged sites 
are direct links between circadian 
eRNAs and genes repressed by Rev- 
erba. De novo motif analysis at en- 
gaged sites revealed enrichment of 
DNA motifs known to be bound by 
Rev-erbo. directly (RORE and RevDR2) 
(22) or indirectly (HNF6) (27) (fig. S5A), 
whereas these motifs were not enriched 
at passive sites (fig. S5B). Although 
the number of sub-TADs containing 
circadian genes from each phase was 
similar (fig. S5C), only engaged Rev- 
erbo sites were enriched in ZT18-24 
circadian sub-TADs (Fig. 3D and fig. 
S5D). Most important, E-P interactions 
at engaged Rev-erba binding sites were 
strengthened at ZT22 relative to ZT10 
(Fig. 3E). Together, these findings 
show that the ability of Rev-erba to 
oppose E-P loops within sub-TADs is 
a likely determinant of active repres- 
sion by Rev-erba. 

We next addressed the mechanism 
by which Rev-erba binding controls cir- 
cadian E-P interactions. The repressive 
action of Rev-erba is often mediated 
by recruitment of a co-repressor com- 
plex containing NCoR (nuclear recep- 
tor co-repressor) and HDAC3 (histone 
deacetylase 3) (23), leading to circadian 
histone deacetylation associated with 
repressed enhancers (3). Indeed, recruit- 
ment of NCoR and HDAC3 was greater 
at engaged Rev-erbo. binding sites, with 
a modest average increase in Rev-erbo. 
binding relative to passive sites (Fig. 4A). 
Consistent with this finding, the previ- 
ously demonstrated circadian acetyla- 
tion of histone H3 Lys?’ (H3K27Ac) in 
mouse liver (5) was exaggerated at 
engaged Rev-erbo binding sites but 
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was nearly absent at passive sites (Fig. 4B). eRNA 
transcription also demonstrated an enhanced 
circadian rhythm at engaged sites, which was 
abrogated in livers lacking Rev-erba (Fig. 4C); 
this finding shows that Rev-erbo. was required 
for the epigenomic rhythms that occurred selec- 
tively at engaged sites. 

The transcriptional regulator BRD4 acts as 
a reader of acetylated histone (24-26) and forms 
a functional transcriptional complex with the 
well-established looping factor MED1 (27-30). 
In agreement with the changes in H3K27Ac at 
engaged sites, BRD4 binding was greater at ZT22 
than at ZT10, and this difference was attenuated 
in the genetic absence of Rev-erba (Fig. 4D). Sim- 
ilarly, MED1 was also evicted at engaged sites 
but not in livers lacking Rev-erba (Fig. 4E). The 
opposition of Rev-erba to the circadian bind- 
ing of BRD4 and MED1 was confirmed by ChIP- 
quantitative polymerase chain reaction (qPCR) 
in both the genetic absence (Fig. 4, F and G) 
and ectopic expression of Rev-erbo (fig. S6, A 
to C). Focusing on Rev-erba binding sites where 
BRD4 is evicted at ZT10 (fig. S6D) revealed the 
concurrent eviction of MEDI (fig. S6E) and also 
independently predicted functional sites with 
circadian eRNA transcription (fig. S6F). How- 
ever, the binding of CTCF and RAD21 was low 
and not circadian at Rev-erbo binding sites 
(fig. S6, G to J). 

Our findings demonstrate genome-wide orga- 
nizational plasticity at the level of sub-TADs that 
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occurs in a circadian manner as a component 
of normal mammalian physiology. The mech- 
anisms by which Rev-erba functionally opposes 
E-P loop formation, leading to circadian repres- 
sion of gene transcription within sub-TADs, are 
likely applicable to other transcriptional repress- 
ors whose function in controlling chromatin ar- 
chitecture is currently not as well defined as 
for transcriptional activators. 


REFERENCES AND NOTES 


. Bass, M. A. Lazar, Science 354, 994-999 (2016). 

S. Takahashi, Nat. Rev. Genet. 18, 164-179 (2017). 

. Feng et al., Science 331, 1315-1319 (2011). 

. Koike et al., Science 338, 349-354 (2012). 

. Vollmers et al., Cell Metab. 16, 833-845 (2012). 

. Fang et al., Cell 159, 1140-1152 (2014). 

R. Dixon et al., Nature 485, 376-380 (2012). 

. P. Nora et al., Nature 485, 381-385 (2012). 

. Sexton et al., Cell 148, 458-472 (2012). 

. Zhao et al., Mol. Cell 59, 984-997 (2015). 

. L. Aguilar-Arnal et al., Nat. Struct. Mol. Biol. 20, 1206-1213 
(2013). 

. Y. Xu et al., PLOS Genet. 12, 1005992 (2016). 

. E. Lieberman-Aiden et al., Science 326, 289-293 (2009). 

S. S. P. Rao et al., Cell 159, 1665-1680 (2014). 

. J. E. Phillips-Cremins et al., Cell 153, 1281-1295 (2013). 

J. M. Dowen et al., Cell 159, 374-387 (2014). 

. R. Siersbeek et al., Mol. Cell 66, 420-435.e5 (2017). 

. S. M. Tan-Wong et al., Science 338, 671-675 (2012). 

. A. R. Grosso, S. F. de Almeida, J. Braga, M. Carmo-Fonseca, 
Genome Res. 22, 1447-1456 (2012). 

20. N. Preitner et al., Cell 110, 251-260 (2002). 

21. Y. Zhang et al., Science 348, 1488-1492 (2015). 

22. H. P. Harding, M. A. Lazar, Mol. Cell. Biol. 15, 4791-4802 

(1995). 
23. L. Yin, M. A. Lazar, Mol. Endocrinol. 19, 1452-1459 (2005). 


WHONAORWNHE 
i 


ram wowzoe 


© 


SGHURGADKRDLR 


24. A. Dey, F. Chitsaz, A. Abbasi, T. Misteli, K. Ozato, Proc. Natl. 
Acad. Sci. U.S.A. 100, 8758-8763 (2003). 

25. J. S. Roe, F. Mercan, K. Rivera, D. J. Pappin, C. R. Vakoc, 
Mol. Cell 58, 1028-1039 (2015). 

26. M. K. Jang et al., Mol. Cell 19, 523-534 (2005). 

27. Y. W. Jiang et al., Proc. Natl. Acad. Sci. U.S.A. 95, 8538-8543 
(1998). 

28. J. Lovén et al., Cell 153, 320-334 (2013). 

29. S. Y. Wu, C. M. Chiang, J. Biol. Chem. 282, 13141-13145 
(2007). 

30. A. S. Bhagwat et al., Cell Rep. 15, 519-530 (2016). 


ACKNOWLEDGMENTS 


We thank M. Adlanmerini, P. Dierickx, B. Fang, D. Guan, R. Papazyan, 
R. Soccio, Y. Aubert, and the Lazar laboratory for helpful 
discussions; J. Phillips-Cremins (University of Pennsylvania) and 
P. Huang and C. Edwards (Children’s Hospital of Philadelphia) for 
technical advice; the Functional Genomics Core of the Penn 
Diabetes Research Center and the Penn Epigenetics Institute for 
next-generation sequencing; and the Viral Vector Core of the Penn 
Diabetes Research Center for recombinant virus preparation. ChIP- 
seq and Hi-C data have been deposited in the Gene Expression 
Omnibus (GSE104129). Supported by the JPB Foundation (M.A.L.) 
and by NIH grants RO1 DK45586 and P30 DK19525 (M.A.L.), ROL 
DK106027 (K.-J.W.), and T32 GM007170, T32 GM008216, and F30 
DK112507 (Y.H.K.). 


SUPPLEMENTARY MATERIALS 


www.sciencemag.org/content/359/6381/1274/suppl/DC1 
Materials and Methods 

Figs. Sl to S6 

Tables S1 to S5 

References (31-35) 


18 August 2017; accepted 17 January 2018 
Published online 8 February 2018 
10.1126/science.aao6891 


4 of 4 


810z ‘8} yore UO /Gi0 Bewseoual0s’e0ue!0S//:d}]y WO} pepeojuMOGg 


RESEARCH 


AGING 


Lysosome activation clears aggregates 
and enhances quiescent neural stem 
cell activation during aging 


Dena S. Leeman,’”” Katja Hebestreit,’* Tyson Ruetz,' Ashley E. Webb,'+ 

Andrew McKay,” Elizabeth A. Pollina,?+ Ben W. Dulken,’* Xiaoai Zhao,’ 

Robin W. Yeo,’ Theodore T. Ho,” Salah Mahmoudi,' Keerthana Devarajan," 
Emmanuelle Passegué,’§ Thomas A. Rando,®” Judith Frydman,”’® Anne Brunet” || 


In the adult brain, the neural stem cell (NSC) pool comprises quiescent and activated 
populations with distinct roles. Transcriptomic analysis revealed that quiescent and 
activated NSCs exhibited differences in their protein homeostasis network. Whereas 
activated NSCs had active proteasomes, quiescent NSCs contained large lysosomes. 
Quiescent NSCs from young mice accumulated protein aggregates, and many of these 
aggregates were stored in large lysosomes. Perturbation of lysosomal activity in 
quiescent NSCs affected protein-aggregate accumulation and the ability of quiescent 
NSCs to activate. During aging, quiescent NSCs displayed defects in their lysosomes, 
increased accumulation of protein aggregates, and reduced ability to activate. 
Enhancement of the lysosome pathway in old quiescent NSCs cleared protein aggregates 
and ameliorated the ability of quiescent NSCs to activate, allowing them to regain a 


more youthful state. 


reservation of a pristine proteome is crit- 

ical for maintaining cell function over long 
periods of time, and decline in proteome 
health is associated with aging and neuro- 
degenerative disease (/, 2). Three primary 
mechanisms are used to maintain protein homeo- 
stasis (proteostasis): molecular chaperones, the 
proteasome proteolytic system, and the lysosome- 
autophagy proteolytic system (2, 3). Proteostasis 
is particularly important for stem cell prolifer- 
ation and differentiation (4). However, somatic 
stem cell pools contain not only proliferating 
(activated) stem cells but also quiescent stem 
cells. How these different types of adult stem 
cells maintain protein homeostasis and deal 
with protein aggregation is largely unknown. 
Protein aggregates are particularly relevant 
in the brain, where their accumulation is as- 
sociated with neurodegenerative disease (J). 
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The adult brain contains pools of regenerative 
neural stem cells (NSCs) that can generate new 
neurons (neurogenesis) (5, 6). In one of the neuro- 
genic niches—the subventricular zone (SVZ)— 
a subset of quiescent astrocytes are the source of 
quiescent NSCs (qNSCs), which in turn give rise to 
activated NSCs (aNSCs) and neural progenitor 
cells (NPCs) (5, 7). These NPCs generate new 
neurons, oligodendrocytes, and astrocytes, con- 
tributing to the maintenance of olfactory func- 
tion, as well as repair after brain injury (4, 8). 
The ability of qNSCs to activate declines with 
age (9-11). Consistently, aging is associated with 
a decline in neurogenesis and a reduction in 
olfactory discrimination and memory (6, 9). Un- 
derstanding the age-dependent changes in pro- 
teostasis in the NSC pool could help identify ways 
to enhance the function of old NSCs and main- 
tain brain health. 

To gain an unbiased overview of the proteo- 
stasis network in different cell types in a re- 
generative brain region during aging, we used 
transcriptomic profiling. We aged cohorts of 
transgenic mice expressing green fluorescent 
protein (GFP) under the control of the human 
glial fibrillary acidic protein (GEAP) promoter, 
which drives expression in astrocytes and NSCs 
(7). Using fluorescence-activated cell sorting (FACS) 
with a combination of GFP and cell surface mark- 
ers, we simultaneously purified five cell types 
from the SVZ of paired young (3- to 4-month- 
old) and old (19- to 22-month-old) mice: endo- 
thelial cells (which provide key environmental 
signals to NSCs), SVZ astrocytes (hereafter, as- 
trocytes), (NSCs, aNSCs, and NPCs (Fig. 1A and 
fig. SLA) (7). Each population expressed well- 
established markers (fig. SIB) and showed ex- 
pected cell-cycle characteristics (fig. S1C). We 
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used RNA-sequencing (RNA-seq) to globally 
profile the transcriptomes of four biological 
replicates per cell type and per age (tables S1 
and S2 and fig. S2, A to C). The transcriptomes 
of endothelial cells, quiescent cells (astrocytes 
and qNSCs), and activated cells (aNSCs and 
NPCs) could be distinguished by principal com- 
ponent analysis (PCA) (Fig. 1B) and hierarchical 
clustering (fig. S2D). By contrast, young and 
old samples from the same cell type were not 
easily separated when all cell types were con- 
sidered together (Fig. 1B and fig. S2D), though 
age-dependent differences were revealed when 
cell types were analyzed individually (see below, 
Fig. 4 and fig. S8). 

Analysis of Kyoto Encyclopedia of Genes and 
Genomes (KEGG) pathways that are up-regulated 
or down-regulated between qNSCs and aNSCs 
showed differences in several homeostatic and 
metabolic pathways (e.g., lipid metabolism, DNA 
repair, cell cycle) (7, 12-15) as well as strong dif- 
ferences in the protein homeostasis network 
(Fig. 1C; figs. S3, A to D, and S4A; and tables $3 
to S6). aNSCs exhibited increased expression of 
proteasome-associated genes (Fig. 1D, fig. S4B, 
and tables S5 and S6) (7, 15). By contrast, (NSCs 
had increased expression of lysosome-associated 
genes (Fig. 1D, fig. S4B, and tables S5 and S6). 
A motif for the transcription factor EB (TFEB), a 
master regulator of lysosomes and autophagy 
(16, 17), was enriched in lysosomal genes of NSCs 
(fig. S4C). More generally, TFEB target genes were 
more highly expressed in qNSCs than in aNSCs 
(fig. S4C) (17), and TFEB was among the top five 
most significant upstream regulators of qNSC 
gene expression (fig. S4D). qNSCs and aNSCs 
also expressed different types of genes that en- 
code protein chaperones (Fig. 1D, fig. S4E, and 
table S6): endoplasmic reticulum (ER) unfolded- 
protein response genes were more highly ex- 
pressed in qNSCs, whereas the TCP-1 ring complex 
(TRiC) and prefoldin complex genes were more 
highly expressed in aNSCs (fig. S4E). Similar differ- 
ences in branches of the proteostasis network were 
observed between the transcriptomes of quiescent 
and activated stem cells of other tissues (muscle 
and hematopoietic) (fig. S4F). Thus, quiescent and 
activated stem cells mobilize different branches of 
the protein quality-control network. 

We experimentally assessed the status of the 
lysosomes and proteasomes in qNSCs and aNSCs. 
Staining for the lysosomal membrane protein 
LAMP-1 revealed that qNSCs freshly isolated 
from the adult brain had more LAMP-1 staining 
than did their activated counterparts (Fig. 1E). 
This was confirmed by staining for LAMP-1 in 
whole mounts of the SVZ (fig. S5A). Staining 
for the lysosomal membrane protein LAMP-2 
in a primary culture system for NSC quiescence 
and activation (18) showed that qNSCs not only 
had more lysosomes but also larger lysosomes 
than aNSCs (Fig. 1F and fig. S5B). The large 
qNSC lysosomes were labeled with LysoTracker, 
a fluorescent probe specific to acidic organelles 
(Fig. 1G), suggesting that these lysosomes had 
the acidity required for proteolytic activity. Lyso- 
somes can fuse with autophagosomes, thereby 
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Fig. 1. qNSCs exhibit a protein quality-control signature distinct from 
their activated progeny, with enlarged lysosomes. (A) Experimental 
design. Endo, endothelial cell; Ast, astrocyte; EGFR, epidermal growth factor 
receptor. (B) PCA of five cell types and two ages, performed on variance 
stabilizing transformation (VST)—-normalized read counts of all detected genes. 
PC, principal component. (C) Selected KEGG pathways significantly upregulated 
in NSCs or aNSCs. P values determined by bootstrap sampling, with 
Benjamini-Hochberg correction. *False discovery rate (FDR)—adjusted P < 0.05; 
**EDR-adjusted P < 0.001; ***FDR-adjusted P < 0.0001. See also fig. S3 and table 
S5. (D) Heatmap showing expression levels of all genes in pathways from the 
three main branches of proteostasis: proteasome, lysosome, and 
chaperones. VST-normalized read counts, scaled row-wise. V-ATPase, vacuolar 
H*—adenosine triphosphatase. See also table S6. (E) Median fluorescence of 
LAMP-1 normalized to cell size, as measured by flow cytometry. Mean + SEM of 
values from six different mice (3 to 4 months old) analyzed on a single day. Each 
point represents FACS-sorted cells from a single mouse. P value determined 


Leeman et al., Science 359, 1277-1283 (2018) 16 March 2018 


by one-sided Wilcoxon rank sum test. **P < 0.01. (F and G) Representative 
immunofluorescence images of primary cultures of NSCs and aNSCs 
from 3-month-old mice stained with (F) LAMP-2 (green, lysosomal 
membranes) and 4’,6-diamidino-2-phenylindole (DAPI) (blue, nuclei) or 
(G) LysoTracker (red, acidic organelles) and Hoechst (blue, nuclei). Scale bars, 
5 um. See also figs. SSB and S9B. (H) Top: In cells expressing mCherry-GFP- 
LC3, autophagosomes display both GFP and mCherry fluorescence (yellow-green), 
whereas autolysosomes display only mCherry fluorescence (red) because 

GFP is denatured by the acidity of the lysosome. Bottom: Representative images of 
GFP and mCherry fluorescence in primary cultures of (NSCs and aNSCs from 
3-month-old mice. Scale bars, 5 um. See also fig. SSC. (I) Fluorescence level of 
Me4BodipyFL-Ahx3Leu83VS, a probe that covalently binds active proteasomes, in 
populations directly isolated from the brain. Mean + SEM of values from four 
different mice (3 to 4 months old) sorted and analyzed on two different days. Each 
point represents cells from a single mouse. P value determined by one-sided 
Wilcoxon rank sum test. *P < 0.05. 
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forming autolysosomes that, in turn, degrade 
autophagosome contents (3). Using a system in 
which LC3 (a marker of autophagosomes) is fused 
to both GFP and mCherry (Fig. 1H) (19), we found 
that qNSCs contained numerous large lysosomes 
that had fused with autophagosomes to form 
autolysosomes but had not yet degraded their 
contents (Fig. 1H and fig. S5C). Consistently, 
NSCs freshly isolated from the SVZ showed 
less accumulation of LC3 than did aNSCs in 
response to inhibitors of lysosomal acidification 
(bafilomycin A and chloroquine) (fig. S5D), indic- 
ative of slower degradation of autophagosomes 
by qNSC lysosomes. Thus, although the lysosomes 
of qNSCs are acidified, they degrade their con- 
tents relatively slowly at steady state. 

We then tested whether proteasome activity 
was different between qNSCs and aNSCs. As pre- 
dicted by our RNA-seq data, qNSCs directly iso- 
lated from the brain had reduced staining for 
catalytically active proteasomes (Fig. 1I) (20) and 
conjugated ubiquitin (fig. SSE) compared to that 
of aNSCs. Furthermore, lysates from primary cul- 
tured qNSCs had less proteasome activity than 
did lysates of aNSCs (fig. SSF). Thus, qNSCs and 
aNSCs use different branches of the proteostasis 
network, with qNSCs exhibiting large lysosomes 
and aNSCs displaying active proteasomes. 


Fig. 2. qNSCs have A 
insoluble protein 
aggregates localized 
in large lysosomes. 
(A) Representative 
immunofluorescence 
images of qNSC and 
aNSC primary cultures 
from 3-month-old 
mice stained with 
Proteostat (magenta, 
protein aggregates), 
LAMP-2 (green, lyso- 
somal membranes), 

and DAPI (blue, nuclei). 
Scale bars, 5 um. 

(B) Median fluorescence 
of Proteostat normalized 
to cell size and to the 
median value on the day 
of measurement (to 
combine independent 
experiments). Mean + 
SEM of values from 

10 different mice (3 to 
4 months old) analyzed 
by flow cytometry on 
four different days. 

Each point represents 
FACS-sorted cells from a 
single mouse. P value 
determined by two-sided 
Wilcoxon rank sum test. 
**P < 0.01. (C) Repre- 
sentative SDS-PAGE gel 
stained with SYPRO 
Ruby for detection of 


qNsc 


The differences between protein quality-control 
pathways in qNSCs and aNSCs raise the question 
of each cell type’s ability to regulate proteome 
health (fig. S6A). We stained young adult qNSCs 
and aNSCs freshly purified from the brain and 
in primary culture with Proteostat dye, which 
becomes highly fluorescent upon binding to the 
amyloid-type B-sheet tertiary structure of protein 
aggregates (27). NSCs, even from young adults, 
exhibited increased Proteostat staining com- 
pared to that of aNSCs (Fig. 2, A and B). The 
accumulation of protein aggregates in young 
qNSCs was surprising, given that cellular quies- 
cence is accompanied by reduced protein syn- 
thesis (74, 15), and indeed, freshly isolated qNSCs 
had less protein synthesis than did aNSCs (fig. 
S6B). A large proportion of protein aggregates 
in qNSCs were contained within lysosomes (Fig. 
2A), consistent with the observation that qNSC 
lysosomes degrade the contents of autophago- 
somes more slowly. 

Biochemical fractionation of lysates from 
qNSCs and aNSCs to enrich for protein aggre- 
gates confirmed that qNSCs contained signifi- 
cantly more insoluble protein aggregates than 
did aNSCs (Fig. 2C and fig. S6, C to E). The pres- 
ence of proteins at a variety of molecular weights 
in the qNSC insoluble fraction suggests that 


aNSC B 


protein in the total, soluble, and insoluble fractions of GNSCs and aNSCs. See also fig. S6, C to E. 
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QNSC aggregates contain a wide array of pro- 
teins (Fig. 2C). Additionally, accumulation of 
qNSC insoluble proteins at the top of the SDS- 
polyacrylamide gel electrophoresis (SDS-PAGE) 
gel is consistent with the presence of highly or- 
dered amyloid-like aggregates (22) (Fig. 2C and 
fig. S6E). Thus, NSCs contain more protein ag- 
gregates than their activated counterparts, and 
this aggregation appears to affect many proteins. 
The accumulation of protein aggregates in healthy 
young adult qNSCs raises the possibility that these 
aggregates play a physiological role. 

The primary function of qNSCs is to generate 
aNSCs in response to growth factor signals. We 
tested whether manipulations that regulate lyso- 
somal activity could affect protein aggregates in 
qNSCs and, in turn, influence activation of qNSCs 
in response to activation signals [epidermal growth 
factor (EGF) and basic fibroblast growth factor 
(bFGF)]. Treatment with bafilomycin A, which 
blocks lysosomal acidification, led to increased 
accumulation of protein aggregates (Fig. 3, A 
and B, and fig. S7A) and to decreased activation 
in response to growth factors (Fig. 3C). To in- 
duce lysosomal degradation, we deprived cells 
of nutrients by incubating them in Hank’s bal- 
anced salt solution (HBSS) (23). Nutrient dep- 
rivation of qNSCs led to a significant decrease 
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in lysosomal staining (fig. S7B) and in the amount 
of protein aggregates (Fig. 3, A and D, and fig. S7, 
C and D). A brief pulse of nutrient deprivation 
in qNSCs enhanced their subsequent activation 
by growth factors, and this was abolished by 
bafilomycin A (Fig. 3E). Enhanced qNSC activa- 
tion did not result from direct effects of nutrient 
deprivation on proliferation: Nutrient depriva- 
tion did not affect proliferation of qNSCs in the 
absence of growth factors, and indeed, it de- 
creased proliferation of aNSCs (fig. S7E). To ac- 
tivate lysosome function more selectively, we 
used expression of TFEB—a master regulator of 
lysosome biogenesis and autophagy (J6, 17). Brief 
expression of an activated form of TFEB in qNSCs 
(24) (fig. S7F) led to decreased abundance of 
protein aggregates in qNSCs (Fig. 3F) and to en- 
hanced qNSC activation in response to growth 
factors (Fig. 3G). Thus, activating the lysosome- 
autophagy pathway in qNSCs cleared protein 
aggregates and improved qNSC activation in 
response to growth factors. 

The collapse of protein homeostasis is a key 
feature of aging in differentiated cells (1). To 
understand how aging affects protein homeo- 
stasis in stem cells, we analyzed our RNA-seq 
data sets for changes in global transcriptional 
profiles as well as specific transcriptional sig- 
natures in qNSCs and aNSCs freshly purified 
from old (19- to 22-month-old) versus young 
(3- to 4-month-old) mice (Fig. 1A). Quiescent cell 
types (qNSCs and astrocytes) displayed more 
and larger-magnitude transcriptional changes 
with age than did their activated counterparts 
(aNSCs and NPCs) (Fig. 4A, fig. S8A, and tables 
87 and S8). This was not caused by technical 
variability between biological replicates (fig. S8B) 
and was also observed with a different transcript 
quantification and analysis method (fig. $8, C 
and D). Consistently, the transcriptome of young 
and old qNSCs and astrocytes could be dis- 
tinguished by PCA, whereas that of young and 
old aNSCs and NPCs could not (fig. S8, E and F). 
The age-associated differences in qNSCs could 
reflect changes that arise from a heterogeneous 
population of qNSCs (5, 17, 15). The expression 
of genes associated with lysosome function, 
but not proteasome function, was significantly 
changed with age in qNSCs but not in aNSCs 
(fig. S8G). Thus, qNSCs undergo more tran- 
scriptional changes with age than do aNSCs and 
NPCs, and the expression of lysosome genes 
changes with age. 

We experimentally examined the status of lyso- 
somes and proteasomes in old qNSCs and aNSCs. 
Old qNSCs in their in vivo niches or freshly pu- 
rified from the brain exhibited less lysosomal 
staining (LAMP-1) than young qNSCs (Fig. 4, 
B and C), and this decrease was significantly 
greater than that observed with age in aNSCs 
(Fig. 4C). This reduction in lysosomal staining 
was also observed in primary qNSC cultures from 
old mice (fig. S9, A and B). The age-dependent 
decrease in lysosome levels could result in fewer 
lysosomes available to fuse with autophago- 
somes (and/or endosomes). Indeed, using old 
GFP-LC3 reporter mice, we identified a distinct 
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Fig. 3. Modulation of lysosomal activity in qNSCs affects protein aggregates and the transition 
from quiescence to activation. (A) Representative immunofluorescence images of primary cultures 
of qNSCs from 3-month-old mice after 18 hours of treatment with 50 nM bafilomycin A (BafA, a 
lysosomal V-ATPase inhibitor) or 3 hours of nutrient deprivation in HBSS. Cells are stained with 
Proteostat (magenta, protein aggregates), LAMP-2 (green, lysosomal membranes), and DAPI (blue, 
nuclei). Scale bars, 5 um. Basal, quiescence media. (B) Median Proteostat fluorescence normalized 
to the median value per replicate to combine samples prepared at different times. Mean + SEM of 
values from six biological replicates analyzed by flow cytometry on two different days. Each point 
represents an independent primary culture derived from two 3- to 4-month-old mice. (C) Percentage 
of cells that incorporated 5-ethynyl-2’-deoxyuridine (EdU) (cells in S phase of the cell cycle) during a 
3-hour pulse, as measured by intracellular flow cytometry. Mean + SEM of values from nine different 
biological replicates analyzed on four different days. Each point represents one independent primary 
culture derived from two 3- to 4-month-old mice. (D) Differences in Proteostat* area quantified from 
images. Mean + SEM of values from five independent biological replicates from two independent 
experiments. (E) Percentage of cells that incorporated EdU measured as in (C). Mean + SEM of 
values from nine different biological replicates analyzed on four different days. (F) Proteostat fluorescence 
measured as in (B). Mean + SEM of values from nine biological replicates analyzed on three different 
days. rtTA, reverse tetracycline-controlled transactivator. (G) Percentage of cells that incorporated EdU 
measured as in (C). Mean + SEM of values from six different biological replicates analyzed on two different 
days. For all panels, P values determined by one-sided Wilcoxon rank sum test. *P < 0.05; **P < 0.01. 
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Fig. 4. Old (NSCs show lysosome defects, increased protein aggregates, 
and decreased activation, but these can be counteracted by lysosomal 
activation. (A) Log2 fold changes between young (3- to 4-month-old) and 
old (19- to 22-month-old) samples of each cell type for all detected genes 
using DESeq2 (36). Each point represents one gene. Genes in color are 
significantly (“sig.”) up- or down-regulated with age with FDR = 0.05. Genes in 
gray are not significantly up- or down-regulated with age. (B) SVZ whole-mount 
staining of young (3-month-old) and old (21-month-old) qNSCs. Left: 

LAMP-1 (lysosomes, yellow) staining in NSCs identified by the combination of 
a KI67-negative cell soma and a GFAP* (red) projection through ependymal 
cell pinwheels at the ventricular surface (KI67, proliferation marker). Scale bar, 
2 um. Right: Quantification of LAMP-1 fluorescence intensity in GNSCs 
normalized for cell size from three young and three old mice (n = 65 qNSCs). 
(C) Median fluorescence of LAMP-1 normalized for cell size. Mean + SEM of 
values from three to four young and old mice, analyzed by flow cytometry on a 
single day. Each point represents FACS-sorted cells from a single mouse. 

(D) Representative histograms of the GFP-LC3 fluorescence per cell in (NSC 
and aNSC populations from a 5-month-old versus 25-month-old transgenic 
GFP-LC3 mouse, as measured by flow cytometry. See also fig. S9, C and D. 
(E) Median fluorescence of Proteostat in GFP-LC3-high qNSCs and aNSCs and 
NPCs from young (2- to 5-month-old) and old (25-month-old) GFP-LC3 mice 
normalized to the median value per replicate to combine samples prepared at 
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different times. Mean + SEM of values from three to four different mice 
analyzed by flow cytometry on a single day. Each point represents cells 
FACS-sorted from a single mouse. (F) Median fluorescence of Proteostat 
normalized to the median value per replicate. Mean + SEM of values from 
nine biological replicates analyzed on three different days. Each point 
represents one independent primary culture derived from a 17- to 
22-month-old mouse. (G) Percentage of cells in an individual culture that 
incorporated EdU (cells in S phase) during a 3-hour pulse, as measured by 
intracellular flow cytometry. Mean + SEM of values from six different biolog- 
ical replicates analyzed on two different days. Each point represents one 
independent primary culture derived from two 3-month-old or 20-month-old 
mice. (H) Percentage of cells that incorporated EdU measured as in (G). 
Mean + SEM of values from six different biological replicates analyzed on two 
different days. Each point represents one independent primary culture 
derived from a 17- to 22-month-old mouse. (1) Old mice (22 months old) 
were treated for 3 months with rapamycin in mouse chow. SVZ compo- 
sition was then analyzed by FACS. Each point represents the abundance of 
aNSCs and aNPCs (EGFR*CD24-CD31°CD45" cells) in a single mouse, 
expressed as the percentage of total live cells in the SVZ. Mean + SEM of values 
from six to seven different mice. P values [except panel (B)] determined 
by one-sided Wilcoxon rank sum test; Panel (B) P values determined by 
two-sided Wilcoxon rank sum test. *P < 0.05; **P < 0.01; ***P < 0.01. 
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subpopulation of old qNSCs that had increased 
amounts of GFP-LC3 fluorescence (Fig. 4D and 
fig. S9, C and D), indicative of reduced autophagic 
flux into acidified lysosomes (where GFP is nor- 
mally denatured) (25). There was heterogeneity 
within the qNSC population with age (Fig. 4D 
and fig. S9, C and D), indicating that a subset of 
old qNSCs have defects in their lysosomes. By 
contrast, proteasome activity was not significantly 
altered with age in qNSCs or aNSCs (fig. S9E). 

We tested how protein aggregates are af- 
fected by aging in qNSCs. The subpopulation of 
old qNSCs with high amounts of GFP-LC3 exhib- 
ited an increased amount of protein aggregates 
in vivo (Fig. 4E). Consistently, primary cultures 
of qNSCs from old mice also exhibited an in- 
crease in abundance of protein aggregates (fig. 
SOF). The age-dependent accumulation of GFP- 
LC3 and protein aggregates in vivo was reduced 
by subjecting mice to fasting for 48 hours (fig. 
S9G), indicating that these aggregates can be 
modulated by nutrient deprivation. The increased 
amount of protein aggregates in old qNSCs could 
also be reduced by expression of constitutively 
active TFEB (Fig. 4F). Thus, old qNSCs exhibit 
increased protein aggregation, which may result 
from age-dependent reduction in lysosome num- 
ber or function, and this can be counteracted by 
TFEB expression. 

The ability of qNSCs to activate declines with 
age (9-11). Consistently, old qNSCs in primary 
culture activated less efficiently than young 
qNSCs in response to growth factors (Fig. 4G). 
Interestingly, a pulse of expression of the active 
form of TFEB in old qNSCs restored activation 
in response to growth factors (Fig. 4H), suggest- 
ing that increased lysosomal function helps 
promote activation of old NSCs. Furthermore, 
systemic treatment of old mice with rapamycin, 
a mechanistic target of rapamycin (mTOR) in- 
hibitor that enhances the lysosome-autophagy 
pathway (26, 27), resulted in increased abundance 
of aNSCs in the SVZ (Fig. 41). Although rapamycin 
could also exert its effects indirectly by acting in 
cells other than NSCs and/or by regulating other 
pathways (26), these data are consistent with the 
possibility that increasing lysosomal function 
can clear protein aggregates and counteract the 
decline in activation of old qNSCs (fig. S9H). 

Our study identifies a previously uncharac- 
terized organelle state in qNSCs: an enlarged 
lysosome that contains a large amount of in- 
soluble protein aggregates. The clearance of gNSC 
aggregates by lysosomal activation may provide 
a burst of energy for activation or prevent the 
propagation of potentially toxic aggregates to 
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NSC progeny. Although changes in autophagy 
during aging have been observed in hemato- 
poietic and muscle stem cells (27-29), these 
studies did not examine the lysosomes and their 
protein aggregate content in these stem cells 
during aging. As lysosomes are central hubs for 
key cellular trafficking, signaling, and meta- 
bolic pathways, targeting of lysosome biogene- 
sis and/or activation to clear aggregates in qNSCs 
may provide broad strategies to allow activation 
of old mammalian cells in vivo. This possibility 
is intriguing in light of the benefits of dietary 
restriction or starvation followed by refeeding 
to regenerative stem cell pools (30, 32) and the 
effects of the lysosome-autophagy pathway on 
protein aggregates (32) and life span (33) in 
Caenorhabditis elegans. Identifying the con- 
served mechanisms by which protein aggregates 
are regulated in aging stem cells should help 
identify specific ways to restore function to old 
stem cells and tissues. 
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SINGLE-CELL ANALYSIS 


Mutation dynamics and fitness effects 
followed in single cells 


Lydia Robert,”?* Jean Ollion,' Jerome Robert,’ Xiaohu Song,” 
Ivan Matic,? Marina Elez’** 


Mutations have been investigated for more than a century but remain difficult to observe 
directly in single cells, which limits the characterization of their dynamics and fitness effects. 
By combining microfluidics, time-lapse imaging, and a fluorescent tag of the mismatch 
repair system in Escherichia coli, we visualized the emergence of mutations in single cells, 
revealing Poissonian dynamics. Concomitantly, we tracked the growth and life span of 
single cells, accumulating ~20,000 mutations genome-wide over hundreds of generations. 
This analysis revealed that 1% of mutations were lethal; nonlethal mutations displayed a 
heavy-tailed distribution of fitness effects and were dominated by quasi-neutral mutations 


with an average cost of 0.3%. Our approach has enabled the investigation of single-cell 
individuality in mutation rate, mutation fitness costs, and mutation interactions. 


he pace of evolution and possible trajecto- 
ries depend on the dynamics of mutation 
incidence and the effects of mutations on 
fitness. Mutation dynamics has never been 
analyzed directly. It is assumed to be Pois- 
sonian, but mutagenesis characterized by bursts, 
with transient mutation rate increases, has been 
demonstrated in several organisms (J-3). Muta- 
tion bursts may facilitate the acquisition of com- 
binations of mutations, which is critical to the 
evolution of complex traits (4). Effects of mu- 
tations are known to range from beneficial to 
deleterious or lethal, but estimating the entire 
distribution of fitness effects (DFE), central to 
evolutionary modeling, has proven challenging 
(5). The DFE for spontaneous mutations has been 


Fig. 1. Experimental setup. (A) The mother 
machine microfluidic chip, used for growing 
E. coli cells during uMA and MV experiments. 
uMA and MV experiments were performed 
independently, using phase-contrast micros- 
copy and epifluorescence microcopy, respec- 
tively. (B) Phase-contrast images of one 
microchannel over time in a wMA experiment. 
We developed software that measures the 
mother cell length (represented by yellow 
boxes) at each frame and computes its fitness 
at each generation (14). (©) The mother 
machine allowed us to keep the mother cell 
at each division, regardless of its fitness, thus 
blocking natural selection. (D) Tracking point 
mutation emergence in single cells with 
YFP-MutL in the mutH strain, where all repli- 
cation errors are converted into mutations. 
(E) Overlay of red (from the constitutive 
expression of tdCherry for automated cell 
segmentation) and yellow (YFP-MutL) fluores- 
cence images from an MV experiment. The inset 
shows a magnified image of a cell with a 
YFP-MutL focus (arrow). 
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estimated with the mutation accumulation (MA) 
approach (6-8), whereby independent lines of 
individuals are propagated in conditions that 
minimize the effects of natural selection and the 
evolution of their fitness is tracked. The preci- 
sion of such DFE estimations is limited because 
of the small number of lines that can be moni- 
tored, typically fewer than 100. In addition, only 
an upper boundary for the average mutational 
effect is obtained, or strong assumptions are made 
regarding the DFE shape (9, JO). Finally, in MA 
studies of microorganisms, the mutation sample 
is biased, as natural selection purges lethal and 
strongly deleterious mutations. 

We quantitatively characterized the dynamics 
of spontaneous point mutations arising from 


A 
‘ Outlet 
Wy Ss = 
“UMM a 
>1009 <n fr, 
000 microchay Ma, 
els 
B Time 


Time 
o_o 


Replication error YFP-MutL accumulation 


16 March 2018 


Mutation 


replication errors, a major source of muta- 
tions (1, 12), and their DFE in the bacterium 
Escherichia coli. For real-time detection of muta- 
tions in single living cells, we developed a mu- 
tation visualization (MV) experiment. We also 
designed a microfluidic MA (uMA) experiment, 
which allowed us to track fitness in the complete 
absence of natural selection in a high-throughput 
fashion during the accumulation of mutations. 
In uMA experiments, we grew cells in a “mother 
machine” microfluidic chip (13) (Fig. 1A) and im- 
aged them by phase-contrast microscopy to mon- 
itor their growth and survival at the single-cell 
level (J4) (Fig. 1B and movie S1). The mother 
machine contained a series of separate micro- 
channels where cells grew in a single row (Fig. 
1A). These microchannels were closed on one 
side and retained the “mother” cell abutting the 
dead end through consecutive divisions. There- 
fore, at each generation, one individual was kept 
independently of its fitness (Fig. 1C), thus elim- 
inating natural selection. Malthusian fitness, the 
exponential growth rate at the population level, 
can also be defined at the single-cell level as the 
cell exponential elongation rate. We imaged more 
than 1000 microchannels in parallel, in con- 
trolled and constant conditions, every 4 min 
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over the course of 3 days, which corresponded 
to ~200 generations. We obtained ~10° fitness 
measurements per experiment. 

In MV experiments, cells grown in the mother 
machine were imaged by fluorescence micros- 
copy (/4). We visualized mutations using a fusion 
of yellow fluorescent protein (YFP) with MutL 
mismatch repair protein, which forms fluores- 
cent foci at replication errors (15) (Fig. 1, D and 
E, and movie S82). To detect and track MutL foci, 
we developed an automated image analysis pro- 
cedure (J4). The dynamics of mutations origi- 
nating from replication errors depends on both 
error production and repair. We examined the 
dynamics of error production by eliminating er- 
ror repair through inactivation of the mismatch 
repair endonuclease MutH (/4). Full inactiva- 
tion was verified by control experiments (74). The 
YFP-MutL protein forms transient foci (15) (movie 
$2), and the average lifetime of the foci in the 
mutH strain corresponds to the time between 
the passage of two successive replication forks 
(14) (fig. S1). This pattern shows that in mutH 
cells, all MutL foci are converted into mutations 
by the next round of DNA replication. 

Replication errors might occur at a constant 
rate with Poissonian dynamics, or alternatively, 
bursts of errors might be produced from fluctua- 
tions in intracellular composition, such as transient 
nucleotide pool imbalances (/, 3). To investigate 
these possibilities, we performed MV experiments 
using mutH cells, tracking more than 3000 MutL 
foci in mother cells and deriving their interarrival 
times (the times between two successive focus 
occurrences) (Fig. 2A). These interarrival times 
were exponentially distributed and uncorrelated, 
which is characteristic of a Poissonian process (14) 
(Fig. 2, B and C, and fig. S2). In agreement with 
this finding, the number of MutL foci appearing 
between two frames followed a Poissonian dis- 
tribution (fig. $3). As our method is limited by 
optical resolution, we cannot detect bursts of rep- 
lication errors if they occur at a single replication 
fork on a short genomic region (typically less than 
80 kilobases). In contrast, we can detect transient 
hypermutator states if they affect several repli- 
cation forks, as would be expected for metabolic 
fluctuations, because different forks occupy dif- 
ferent positions in the cell. 

Our data indicate Poissonian dynamics rather 
than dynamics characterized by bursts, but they 
do not exclude some fluctuations in the repli- 
cation error rate. In particular, we expect the er- 
ror rate to depend on the number of replication 
forks, which changes during the cell cycle (74). 
We found a global proportionality between cell 
size, a proxy for the stage in the cell cycle, and 
the replication error rate across a range of cell 
sizes (fig. S4). Further analysis of the relation- 
ship between cell size and error rate revealed 
steplike variations reminiscent of variations in 
the number of replication forks (Fig. 2, D and E), 
with error rates varying by a factor of ~3 during 
a single cell cycle. 

When exposed to certain external stresses, 
bacteria trigger specific responses that transi- 
ently increase the mutation rate (J6-18). In our 
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Fig. 2. Replication error dynamics. (A) The variables analyzed in MV experiments were the inter- 
arrival times for YFP-MutL foci (8) and the number of new foci per mother cell. 8, and 6,4; indicate 
two successive interarrival times. (B) Histogram of focus interarrival times for a representative 
mutH experiment (n = 1873 foci). The red line and dot are predictions under the Poisson process 
hypothesis, which includes a correction accounting for discrete observation (14). PDF, probability 
density function. (©) Correlation between two successive focus interarrival times for the data 

in (B). Color indicates local density (inset). Red dots represent data binned according to 6,. 

(D) Theoretical number of forks as a function of time for a doubling time of 26 min. Chromosomes 
are depicted in green, and the red dots denote replication forks. (E) Rate of replication error 

as a function of cell size in single cells. Black dots, experimental data; red line, prediction based 
on the number of forks (14). (F) Error rate per fork as a function of cell size and growth rate 

for single cells. Data were binned according to either cell size or growth rate. Results are 


means, and errors bars show +2 SEM. 


MV experiments, a small fraction of cells expe- 
rienced endogenous stress, as demonstrated by 
reduced growth rate and/or abnormally large 
cell size. We investigated whether such subpop- 
ulations exhibit higher rates of replication error. 
To take into account the different replication 
fork numbers for different subpopulations, we 
estimated the number of forks in single cells 
and the mean error rate per fork in each sub- 
population (/4). Abnormally large cells and slow- 
ly growing cells did not present elevated error 
rates per fork (Fig. 2F and fig. S5), suggesting 
that stress-induced mutagenesis (SIM) acts main- 
ly at the level of error repair or that endoge- 
nous stress in a favorable environment causes 
limited SIM. 

Visualization of mutations and fitness mea- 
surements could in principle be combined into a 
single time-lapse microscopy experiment. How- 
ever, simply linking one mutation to one fitness 
effect is impossible for two reasons: First, there 
can be a substantial lag between the occurrence 
of a mutation and its effect on fitness. Second, a 
MutL focus indicates the insertion of an erro- 
neous base in one of two newly replicated DNA 
strands. Consequently, the progeny from a 
cell with a MutL focus will contain a mixture 
of mutant and nonmutant cells (74) (fig. $6). 
Therefore, we performed uMA and MV experi- 
ments separately, which allowed us to increase 
the throughput of p.MA experiments. 

Using MV, we estimated mutation rates for 
E. coli wild-type (WT) (~0.0022 mutation/hour) 
and mutH mutant (~0.32 mutation/hour) strains 
(14), which agree with estimations from whole- 
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genome sequencing of MA lines of WT and 
mismatch repair-deficient cells (17). We then 
performed a uMA experiment with WT cells, 
which accumulated an average of ~0.1 mutation 
per mother cell line over ~60 hours. As ex- 
pected, the growth rate distribution for WT cells 
was stable over time (Fig. 3A, left). In contrast, 
for mutH cells, which accumulated ~20 muta- 
tions per mother cell line over ~60 hours, the 
mean growth rate slightly decreased (Fig. 3A, 
middle), and some cells underwent a steplike 
decrease in growth rate (Fig. 3B, blue trajectory) 
or a total cessation of growth, indicating cell 
death (fig. S7). Surprisingly, the average de- 
crease in the growth rate of mutH cells during 
uMA (~0.1% per hour) was smaller than what 
would be expected from previous estimates of 
the average effect of mutations (7, 19, 20). We 
also performed MA experiments using the strain 
MFI (/4), which has a mutation rate ~20 times 
that of the mutH strain (14) (table S1) because 
of impaired proofreading (21). As expected, the 
decrease in fitness in MF1 cells was ~20 times 
that in mutH cells (Fig. 3A, right, and table S2). 

We then characterized the DFE by developing 
a nonparametric method, with no assumptions 
about DFE shape (/4). Using the probabilistic 
framework of MA studies, we derived all the 
moments of the DFE (/4). For nonlethal mu- 
tations, our estimate of mean fitness cost (Fig. 
3C) is 0.31% + 0.04%, which is lower than pre- 
vious estimates (7, 19, 20). We found that the 
DFE for nonlethal mutations has a coefficient of 
variation (CV) of 9.5 + 1.2, a skewness of 16.6 + 
0.7, and a kurtosis of 360 + 90 (Fig. 3C). We also 
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obtained moments of higher order for the DFE 
(table $3). These results indicate a distribution 
that is wide, skewed, and heavy tailed, domi- 
nated by quasi-neutral mutations relative to in- 
frequent, strongly deleterious mutations, such as 
the beta distribution in Fig. 3D. 

The estimated DFE for MF1 was similar to the 
DFE for the mutH strain (table S4), although 
these strains have different mutation spectra 
(11, 21). To further investigate the effect of the 
mutation spectrum on the DFE, we performed 
uMA experiments using a mutT mutant (fig. S8), 
which has a mutation rate similar to that of the 
mutH strain but a specific mutation spectrum 
of >99% AT-to-CG transversions (11, 12, 14) (table 
Sl). Despite the differences in the way mutagen- 
esis occurs, we found that the DFE for the mutT 
strain did not differ substantially from the DFE 
for the mutH or the MF! strain (table S4). 

Steplike growth rate decreases such as those 
shown in Fig. 3B (blue trajectory) occurred dur- 
ing uMA in all mutator strains and not in the 
WT strain. They might be provoked by a single 
deleterious mutation, independently of previous- 
ly acquired mutations, or by the accumulation of 
interacting mutations. To investigate this issue, 
we studied the rate of occurrence of growth rate 
decreases in mutH and mutT cells. From the anal- 
ysis of all single-cell trajectories (14) (fig. S9), we 
defined two categories of cells: slow-growing cells 
that underwent a >30% decrease in growth rate 
(Fig. 4A, top) and fast-growing cells correspond- 
ing to the rest of the population. The percentage 
of fast-growing cells decreased exponentially over 
time (14) (Fig. 4A, bottom), demonstrating that 
>30% growth rate decreases are caused by single 
deleterious mutations with >30% fitness cost. 
These mutations occur at a constant rate, inde- 
pendently of the accumulation of previous mu- 
tations, and represent 0.3% of all mutations. As 
expected from the quasi-optimal growth rate of 
our mutH strain, our analysis of single-cell tra- 
jectories detected no strongly beneficial muta- 
tions (>20% effect) during pMA (74). 

Likewise, to assess whether lethal mutations 
occur at a constant rate, we studied survival 
during nMA (14). The WT strain, accumulating 
~0.1 mutation over the course of uMA, showed 
age-related mortality (Fig. 4B, left), in agreement 
with previous observations (13). The high-mutation- 
rate mutH, mutT, and MF! strains exhibited ad- 
ditional mutation-related mortality (Fig. 4B, left). 
During the first ~35 hours of uMA, when the 
mortality rate for the WT was low and approx- 
imately constant, the survival of the mutH, mutT, 
and MFI mutator strains exponentially decreased 
(Fig. 4B, right, and fig. S10). To further disentangle 
age- and mutation-related mortality, we per- 
formed a uMA experiment using the strain pMQ, 
which has increased proofreading deficiency 
compared with MFI (27). pMQ cells died rapidly, 
and their survival decreased exponentially (Fig. 
4B, right, and fig. S11). These analyses demon- 
strate that lethal mutations occur at a constant 
rate in nMA, independently of previously ac- 
quired mutations, and that ~1% of mutations 
are lethal. 
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Fig. 3. p»MA analysis allows nonparametric DFE characterization. (A) Representative nMA results 
for WT, mutH, and MF1 (dnaQ926) strains. For each time point, all single-cell growth rates for all 
mother cells are plotted (WT strain, n = 1.8 x 10° cells; mutH strain, n = 1.1 x 10° cells; MF1 strain, 
n = 2.6 x 10* cells). Color depicts growth rate density. The red line represents the mean growth 
rate evolution. The black line indicates the mean of all growth rates. (B) Examples of single-cell 
growth rate trajectories for the mutH strain (one color per cell line). (©) Nonparametric estimation 
for the first four moments of the DFE allows calculation of the mean, the coefficient of variation 
(CV), skewness, and kurtosis (for each calculation, the mean + 2 SEM for three independent mutH 
experiments is given). (D) Example of distribution with similar moments [beta distribution; parameters, 
a = 0.0074 and B = 2.4, calculated to give the mean and CV as for the DFE in (C), giving skewness = 

17 and kurtosis = 370]. The inset shows the same distribution on a log scale. 
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Fig. 4. Strongly deleterious (>30% cost) and lethal mutations occur at a constant rate during 
uMA. (A) The top panel shows an example of a mutH growth rate trajectory (blue line) corresponding 
to the occurrence of a strongly deleterious mutation in a mother cell. To analyze such events, we defined 
two categories of mother cells, slow-growing cells that underwent an abrupt, >30% growth rate 
decrease and fast-growing cells corresponding to the rest of the population. The bottom panel shows 
the temporal evolution of the percentage of fast-growing cells (represented by blue dots, on a log 
scale; we analyzed 5272 single-cell trajectories and found 156 strongly deleterious mutations). 

The 95% confidence interval is narrower than the line thickness. The red curve is an exponential 

fit (coefficient of determination R* = 0.99). (B) The left panel shows survival curves of the population 
for the WT (n = 1283 single-cell trajectories), mutH (n = 1480), mutT (n = 216), MF1 (n = 838), 
and pMQ (n = 472) strains. The 95% confidence intervals are represented by the shaded areas 
(they are narrower than the line thickness for n > 400). The right panel shows the same survival 
curves on a log scale for mutH (data in blue, exponential fit in red; R® = 0.99), MF1 (data in cyan, 
exponential fit in red; R® = 0.99), and pMQ (data in red, exponential fit in blue; R® = 0.99) strains. 
The plot is limited to ~35 hours, during which age-related mortality was low and approximately 
constant. The 95% confidence intervals are represented by the shaded areas. 


3 of 4 


810Z ‘G} Yyouey UO /G10' Bewsouel0se0ue!S//:d}]y WOd| pepeojuMOGg 


RESEARCH | REPORT 


Our results show that the accumulation of mu- 
tations with <30% fitness cost does not change 
the rate of lethal or strongly deleterious muta- 
tions (>30% cost), as would be expected from the 
saturation or induction of a buffering mecha- 
nism (22, 23). We anticipate that examination of 
various suboptimal genotypes in uMA experi- 
ments would allow further investigation of 
mutation interactions and provide a complete 
picture of epistasis, central to the evolution of 
sexual reproduction and recombination (24). Our 
approach may also allow for investigation of how 
the DFE and mutation dynamics change in 
response to the environment, shedding light 
on the adaptation potential of populations. 
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Response to Comment on “Synthesis 
and characterization of the pentazolate 


anion cyclo-N5_ in (N5)6(H30)3(NH4)4Cl” 


Chao Jiang,’* Lei Zhang,”** Chengguo Sun,’* Chong Zhang,’ Chen Yang,! 


Jun Chen,”*+ Bingcheng Hu" 


Huang and Xu argue that the cyclo-Ns ion in (Ns)¢(H30)3(NH,4),CI we described in our 
report is theoretically unfavorable and is instead protonated. Their conclusion is invalid, as 
they use an improper method to assess the proton transfer in a solid crystal structure. 
We present an in-depth experimental and theoretical analysis of (Ns5)6(H30)3(NH,4)4Cl that 


supports the results in the original paper. 


n their comment, Huang and Xu (J) bring up 

the point that the ellipsoid of the O, atom in 

the asymmetry unit plot of the pentazolate 
salt was much larger than those of the re- 
maining atoms in (N5)6(H30)3(NH4)4Cl, and 
conclude that there should actually be less electron 
density than assigned in the model. They claim 
the O, atom could also be a nitrogen atom. How- 
ever, they ignore the anisotropic displacement 
parameters of the disordered atoms; thus, such 
a conclusion is meaningless. Huang and Xu also 
calculated separate ion pair N; --H,0* energet- 
ics for the solid crystal structure and drew the 
conclusion that HN; should exist in place of N; . 
In our paper (2), the (N5)¢(H30)3(NH,4)4Cl crys- 
tal containing the stabilized pentazolate anion 
was prepared by the rupture of the C-N bond in 
3,5-dimethyl-4-hydroxyphenylpentazole through 
treatment with m-chloroperbenzoic acid (7m-CPBA) 
and ferrous bisglycinate [Fe(Gly).]. Although 
the high-quality x-ray crystallographic data pro- 
vided the visualized proof of the structure of the 
pentazolate salt, confirmation of its structure 
and understanding of the cation-anion interac- 
tions have puzzled and intrigued researchers. 
Initial attempts via nuclear magnetic resonance 
(NMR), mass spectrometry (MS), and infrared 
(IR) spectroscopy to confirm the structure of 
(Ns)6(H30)3(NH4)4Cl were complicated. Fortu- 
nately, the expected single crystals were obtained 
by low-temperature evaporation of a solution 
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in ethanol and ethyl acetate. At the very early 
stage, HN; was considered the most likely struc- 
tural unit in pentazolate salt on the basis of NUR 
and MS spectra. However, the x-ray structure 
determination suffered from charge balance and 
the cyclo-N; showed high symmetry with five 


——» (Ns)o(H30)3(NHq)4Cl 


equal N-N bond lengths. We concluded that 
the cyclo-Ns should exist in its anionic form 
rather than its acid form, HN;. The non-hydrogen 
atoms were identified by their peak weights and 
sources from starting materials. Cl atoms gave 
the highest peak weights; N atoms gave the 
lowest. After all the non-hydrogen atoms were 
confirmed and refined, all the H atoms were 
smoothly located from Q maps from the high- 
quality crystal data. Each N atom was bonded 
to four H atoms, which gave an NH,” cation. 
Each O atom was coordinated by six atoms. After 
searching similar ions and considering electric 
neutrality, the O atom was assigned as H;0* 
with disordered H atoms with half occupancy. 
There were two oxygen atoms in the ellipsoid 
plot of (N5)6(H30)3(NH,),Cl, one a bit larger, the 
other cigar-shaped. The difference can be attri- 
buted to the disorder of the water molecules in 
the crystal structure, rather than to incorrect 
atom assignment (3). 

Shown in Fig. 1, A and B, are two “N NMR spec- 
tra of (Ns)6(H30)3(NH,)4Cl in which the essen- 
tial difference is the °N labeling in their starting 
material, 3,5-dimethyl-4-hydroxyphenylpentazole. 
The labeling effect with 'N isotope can be dis- 
cerned by analyzing the spectra (4). The un- 
labeled pentazolate salt shows a lone visible °N 
signal resonant at -356.18 ppm, whereas the 
labeled pentazolate salt has the same signal at 


+ NH,* 


-150 -200 


B N 


sf N 


-250 


15 
—» (Ns5)o(H30)3(NH4)4Cl 


-300 


-50 -100 -150 -200 -250 -300 -350 -400 -450 -500 
C  s000 
6000 
Fig. 1. Spectra. 4000 
15 io 
A) ~N NMR = 2000 
spectrum of Ni 
Ns)6(H30)3(NHq)4Cl. = 0 
B) °N NMR 5 
spectrum of = -2000 
Ns)6(H30)3(NHa4)aCl 
with labeled atom -4000 
at C-N bond position. 
C) EPR spectrum -6000 


of [Fe(Gly)2]/ 
m-CPBA/DMPO 
mixtures at -25°C. 
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-358.61 ppm. Both signals are reasonably assigned 
to NH," (5). The chemical shift difference of 
~2.4 ppm comes from the '°N labeling of the 
NH,” cation, which indicates that the NH,” 
cation is derived from the N atom bonded to 
the aryl ring in aryl pentazole through oxidative 
decomposition. 

As far as the source of Cl is concerned, the 
only chlorine source is m-chloroperbenzoic acid. 
We concluded that the Cl ion must come from 
the dechlorination reaction of m-CPBA (6). Pre- 
vious studies of iron-catalyzed oxidation reac- 
tions (7) indicate that the m-CPBA might react 
with ferrous glycinate and initiate a free radical 
process responsible for the cleavage of the C-N 
bond and the generation of Cl ion. The isolation 
of 2,6-dimethyl]-1,4-benzoquinone from the reac- 
tion mixture supports the C-N bond cleavage 
pathway. In order to explore this free radical- 
based hypothesis, we recorded low-temperature 
electron paramagnetic resonance (EPR) spectra 
(Fig. 1C), which showed a strong EPR signal 
when mixing [Fe(Gly).], m-CPBA, and DMPO 
(dimethyl pyridine N-oxide). In summary, ex- 
cess m-CPBA plays a dual role as the oxidant to 
initiate a free radical pathway combining with 
[Fe(Gly).] and the source of chloride. 

To evaluate the thermodynamic stability of 
(N5)6(H30)3(NH4)4Cl, we used density function- 
al theory calculations to quantify the electro- 
static interactions among the constituent ions 
in pentazolate salt (8). Taking the lattice param- 
eters and atomic coordinates from single-crystal 
x-ray diffraction analysis as input, the geometry 
optimization was performed on the basis of the 
conjugate gradient method. One approxima- 
tion for the hydronium ions is that the hydrogen 
atoms are fixed at three occupancy sites, whereas 
they practically occupy six resonance states at 
50% probability level. The simulated structure 
was considered as finally optimized when the 
residual forces were less than 0.03 eV/A and 
the stress components were less than 0.01 GPa. 
The simulated structure (Fig. 2A) shows satis- 
factory agreement with the characterized struc- 
ture by x-ray diffraction, with the discrepancy of 
the volume at 0.4% and the discrepancy of lat- 
tice vectors at ~1%. At the equilibrium state of 
the crystal, each chloride ion is stabilized by four 
ammonium ions. In Fig. 2B, the four ammonium 
ions around the chloride ion form a tetrahedron, 
resulting in four N-H---Cl hydrogen bonds (2.44 A 
and 8.02 kcal/mol). All the residual hydrogen 
atoms affiliated with ammonium ions directly 
contact the pentazolate anion, and the N---H-N 
hydrogen bond is 2.10 A and 12.06 kcal/mol. All 
the hydrogen atoms affiliated with hydronium 
ions interact with their neighboring pentazolate 
anion, resulting in two types of N---H-O hydrogen 
bonds, the first being 2.17 A and 13.26 kcal/mol 
and the second being 2.26 A and 11.81 kcal/mol. 
With the aid of the electrostatic interactions 
among the ions, the entire pentazolate crystal 
is thermodynamically stable. Binding energy is 
defined as the difference between total energy 
of constituent ions in the free state and total 
energy of the crystal. The binding energy of 
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(Ns)6(H30)3(NH4)4Cl crystal is calculated to be 
1000 kcal/mol, much higher than the binding 
energy of NaCl (187.9 kcal/mol) (9). 

To better understand the basicity of cyclo-N;_ 
ion, a known amount of pentazolate salt can be 
dissolved in deionized water at ambient tem- 
perature, enabling the pH of the solution to be 
accurately and easily determined by pH meter. 
When 50 mg of (Ns5)6(H30)3(NH,),Cl was dissolved 
in 5 ml of deionized water (1.71 x 10°” mol/liter), 
the pH of the solution was measured to be ~5.55 at 
ambient temperature (22.3°C). In contrast, under 
the identical concentration of 1.71 x 10°? mol/liter, 
the pH of aqueous ammonium chloride is 5.51 
(10), hence the solution of (Ns)6(H30)3(NH,4)4Cl 
is mildly acidic. The measured pH values for 
different concentrations of (N;)6(H30)3(NH4)4Cl 
solutions are listed in Table 1. The results lead us 


Se 
Table 1. The pH value of pentazolate 
salt solutions. Specific amounts of 
(Ns)6(H30)3(NH4)4Cl (from 10 to 50 mg) 
were dissolved in 5 ml of deionized 
water at 22.3°C. The obtained solutions 
were analyzed using a pH meter with a 
pH probe. 
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Fig. 2. Calculations. (A) Calculated lattice 
vector lengths (A), angles (°), and volumes 

(A®) of the primitive cell of pentazolate salt, along 
with results from experiments. (B) Hydrogen 
bonds stabilizing (Ns5)6(H30)3(NH4)4Cl at 
equilibrium state. Crystal orbital Hamilton 
population (COHP), the result of multiplying 
the electron density of states by the overlap 
element of the Hamiltonian, shows the 
contribution of a specific contact to band energy. 
The integrated value of COHP at band energy 
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to conclude that the correctly calculated equilib- 
rium constant of cyclo-N; ion is pKp* = 8.425 + 
0.002, indicating that it is more basic than ace- 
tate ion (pK,“° = 9.24) (10). 

In general, NH; has a stronger affinity for 
the proton than for H.O (11). If the HN; in- 
stead of cyclo-N; were present in the crystal of 
(Ns)6(H30)3(NH,4)4Cl, the cyclo-N; ion should 
accept protons from H;0* rather than NH,*. On 
the basis of this hypothesis, we constructed a 
crystal structure model (Fig. 2C) in accordance 
with (Ns)6(H30)3(NH,)4Cl except that one pro- 
ton had shifted from the hydronium ion to the 
cyclo-N; ion (with the H-N bond length set at 
0.95 A). Subsequently, the geometry optimiza- 
tion of the created structure was performed on 
the basis of the conjugate gradient method. One 
unanticipated finding was that the proton on the 
cyclo-N; ion quickly transferred to the nearest 
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H,O molecule to form H;0°. The total energy 
was lowered by 2.74 kcal/mol, implying that the 
cyclo-N; ion is thermodynamically more stable 
than HN; in the crystal structure. The highly 
symmetrical forces in the crystal also support the 
cyclo-N; ion in place of HNs. 
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Comment on “Synthesis and 
characterization of the pentazolate 
anion cyclo-N5 in (N5)6(H30)3(NH4)4Cl” 


Rong-Yi Huang* and Heng Xu 


Zhang et al. (Reports, 27 January 2017, p. 374) reported synthesis of a cyclo-Ns5 ion 
putatively stabilized in a solid-state salt by hydrogen bonding from surrounding 
counterions. We performed theoretical calculations suggesting that HNs would be 
favored over the anion in the reported pentazolate salt via proton transfer. 


hang et al. (1) reported a cyclo-N; ion 
stabilized in the (Ns)6(H30)3(NH,4)4Cl salt 
via hydrogen bonding from the adjacent 
NH,” and H;0* ions. The structure was 
determined by single-crystal x-ray diffrac- 
tion (XRD) analysis. However, the ellipsoid of 
the O, atom in the asymmetry unit plot of the 
pentazolate salt was much larger than those of 
the remaining atoms in the reported pentazolate 
salt, indicating that there should actually be less 
electron density than assigned in the model. The 
O. atom could also be a nitrogen atom. There- 
fore, we suggest that the authors should provide 
solid evidence to confirm the O2 atom in the 
pentazolate salt. Moreover, the phase purity of 
the pentazolate salt and the stability of its crystal 
structure at room temperature could not be de- 
termined on the basis of the physical evidence 
afforded at present. In particular, the cyclo-N; 
ion in (N5)6(H30)3(NH4)4Cl possesses C2, sym- 
metry, and thus the analysis of the infrared 
spectrum of the cyclo-N; ion based on D;, 
symmetry is questionable. In addition, the out- 
of-plane mode peak of the H30* ion cannot be 
observed from the infrared spectrum of the 
pentazolate salt. On the basis of the nature of 
the cyclo-N; ion and H,O, the two expected 
competitive reaction pathways are 


HN;°:‘H,O<[N5°*-H-:-H,O]>N;7 ---H30* 


Thus, in the reported pentazolate salt, the ques- 
tion becomes whether the purported cyclo-N; is 
actually protonated. Recently, Steele et al. (2) 
reported that the crystal of pentazole HN; dis- 
covered using first-principles evolutionary search 
at high pressures is thermodynamically stable, 
so HN; may also exist in the pentazolate salt. 
Because the position of the H atom is difficult 
to confirm by single-crystal XRD methods, we per- 
formed theoretical calculations for all the relevant 
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species as an aid to ascertaining the H atom 
position in the reported pentazolate salt. 

All computations were performed with the 
Gaussian 09 D.01 program (3). Density functional 
theory calculations were carried out using the 


B3LYP (4, 5), wB97X-D (6), and MO6-2X (7) 
methods in combination with the 6-311++G(d,p) 
(8) basis set and the very large aug-cc-pVQZ 
(9, 10) basis set for H, O, and N atoms, re- 
spectively. For all the species explored here, 
the initial geometries were further optimized 
and were followed via frequency analysis to 
confirm minimum energy structures. The vibra- 
tionally stable cyclo-N; ion possesses D;;, sym- 
metry at different levels, in distinct contrast to 
that of the cyclo-N; ion (C,, symmetry) in the 
reported pentazolate salt. However, the exper- 
imental C,,, symmetry of the cyclo-N; ion co- 
incides with that of HN; (C2, symmetry) obtained 
at different levels. This indicates the possible 
formation of HN; during the reaction process. 

To illustrate that HN; can exist in the reported 
pentazolate salt, we obtained the molecular elec- 
trostatic potentials V(r) and average local ion- 
ization energies /(r) on the molecular surfaces 
encompassing cyclo-N; and H,O by means 
of the Multiwfn program (17). Figure 1A pre- 
sents the distribution of the local minima values, 
Vg min(1) and Ismin), on the van der Waals sur- 
face of cyclo-N; and H2O, which are the elec- 


trophilic reactive positions. Vg min(r) and Tsmin®) 
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Fig. 1. Computational results. (A) Distribution of electrostatic potential minima (left) and average 
local ionization energy minima (right) on the van der Waals surfaces of cyclo-Ns (top) and H20 
(bottom) at the B3LYP/6-311++G(d,p) level. (B) The calculated AVs min(1) [Vs min(r) of H20 — Vs min(r) 
of cyclo-Ns_] (red) and Alg min() [s,min(r) of H20 — Is min(r) of cyclo-Ns J (blue) at different levels. 


Table 1. Topological parameters at bond critical points of N-H and O-H bonds for HN; and H30* 
by atoms-in-molecules analysis based on the aug-cc-pVQZ basis set. Values are atomic units. 
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Fig. 2. Geometry optimization. (A) lon pair Ns ---H30* obtained from the crystal structure of 
the pentazolate salt. (B) A barrierless proton transfer during optimization leads to the final optimized 


structure at the wB97X-D/aug-cc-pVQZ level. 


are all associated with the N/O atoms in cyclo-N5~ 
and H,0, except for the partial Vgmin(r) of cyclo-N;- 
located between the two adjacent nitrogen atoms. 
In comprehensively considering the distribution 
of Vgmin(r) and Tsmin(), the strong reactive sites 
are those associated with the N/O atoms in cyclo- 
N; and H,0, respectively, which can be attacked 
by the electrophilic reagents (e.g., M"* and H*). 

As is well known, the addition product, H,0*, 
is stable in corresponding systems. Zhang et al. 
(12) reported a cobalt-based metal complex con- 
taining the cyclo-N;" ligand, in which the Co?* 
and N; ions are linked together though a Co-N 
bond. Burke and colleagues (13), using N nu- 
clear magnetic resonance spectroscopy, con- 
cluded that HN; was also produced and held in 
solution as a Ns ion with Zn". Steele et al. (2) 
reported a thermodynamically stable crystal of 
pentazole HN; discovered at pressures above 
50 GPa using first-principles methods. All of these 
results, obtained experimentally and theoretically, 
indicate that our predictive electrophilic reac- 
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tivities of cyclo-N; and HO are reasonable. 
Further, in looking at the Vg min(r) and Tsmin®) 
values associated with the N/O atoms for cyclo-N; 
and H,0, it is notable that the Vemin(t) and Ig min(t) 
values of cyclo-N; were much lower than those 
found for H,O at different levels. The computed 
AVsmin(t) Vsmin(t) of H20 - Vsmin(t) of cyclo-N5- J 
and Algmin®) Ugmin(t) Of H20 — Igmin(t) of cyclo-N5 J 
were also obtained. As shown in Fig. 1B, the 
AVgmint) and Algmin(t) between HO and cyclo-N;" 
were more than 84 and 122 kcal mol}, respec- 
tively. The large difference values of Vg min(r) 
and Tg min(X) indicate that the electrophilic re- 
activity of cyclo-N;- was much higher than 
that found for H,O, indicating that HN; was 
more favored in the reaction mixture of H*, 
cyclo-N; , and H,O. The topological parameters 
at the bond critical points (BCPs) of the N-H 
and O-H bonds for HN; and H,0* at different 
levels are listed in Table 1. The electronic den- 
sity at the BCPs demonstrates that the N-H bond 
of HN; was also stronger than the O-H bond 
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of H,0*. Thus, HN; can exist in the reported 
pentazolate salt. 

Deeper insight into possible proton transfer 
(PT) from H,0* to cyclo-N; emerged from theo- 
retical calculations performed for the ion pair 
N; --H30* (Fig. 2A), which were taken from the 
crystal structure of the reported pentazolate salt. 
The optimized stable structure of the ion pair 
N; --H30* was also obtained, as shown in Fig. 2. 
We obtained new species in which the proton was 
bonded to the nitrogen atom of cyclo-N; , namely 
the hydrogen-bonding dimer HN;;--H.O (Fig. 2B). 
In addition, the figure presents a continuous 
decrease in the energy for the species studied, 
which occurs as a result of the PT from H30* to 
cyclo-N; and the formation of O---H-N hydrogen 
bonds and N-H bonds with cyclo-N; . The above 
analysis shows a barrierless PT mechanism dur- 
ing optimization. Furthermore, the calculations 
at the wB97X-D/aug-cc-pVQZ level show that 
the deprotonation of H,0* requires an energy of 
166.62 kcal mol”!, whereas the protonation of 
cyclo-N; can provide an energy of 319.77 kcal mol. 
As a result, besides enough energy for the PT 
process, there can be extra energy for release, so 
it is easy for PT to occur in the ion pair studied. 
The above analysis further indicates that the HN; 
can exist in the reported pentazolate salt via PT. 
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Waseda University pushes forward 
with global academic network 


WASEDA __ Innovative programs and prioritized funding have propelled Waseda University to record highs in world 
University university rankings, underscoring the university's reputation for openness, dynamism, and diversity. 


London, Technical University Darmstadt, 
he University of Pisa, Rice University, 

he University of Pittsburgh, and the 
University of Hamburg. “Students must 
spend at least three months at one of 
our partner institutes,” explains Shiba- 


: _ a. “Beginning in 2018, we will offer co- 
M1 €E ome people may wonder about the importance of mathematics in . i : . ee 

cotkgy ie : F . Yoshihiro i: #% eaching courses with the University of 

society,’ says Yoshihiro Shibata, head of the Multiscale Analysis, Shibata # Pisa and intend to expand our network 


Modelling ance Unit. mCwENEE L is ak to es test Gf ineluidecivenieates tennchinae 
COMIpIEN ne eesalt dae meng ST PelMen is sone Ue @ Specific research topics include mathematical modelling based on 
investigating the flow of blood in arteries. The development of science : ‘narthel . Fal fewith ee 

d technology often requires a cycle of mathematical analysis, computer Hels Natalee Solel ouch mtlnh cOuae velista ie nahn 
pes oY q a Yelm P decommissioning of nuclear reactors destroyed in the Fukushima accident; 


simulations, and experimental verification.” : : : 
‘ P analyzing cloud formations to better understand climate change; and 


nie ai members collaborate with international partners to apply : observing blood flow for early diagnosis of cardiovascular issues such as 
nonlinear partial differential equations, geometry, and quantum mechanics aneurysms 


to practical applications such as fluidics, or the flow of gas and liquids. As part "The emphasis of our education and research is to combine expertise in 


of their activities, Shibata and his colleagues also started a doctoral course at ; ; , 
i ; es : ; : mathematics and physics to produce useful models to solve major prob- 
the Faculty of Science and Engineering in April 2017, with lectures in quan- | ost ae i 
ems facing mankind. 


tum mechanics, quantum mathematics, fluid mathematics, numerical fluid 
dynamics, experimental hydrodynamics, and geometry. Students participate 


The role of mathematics in science 
and technology 

—WMultiscale Analysis, Modelling 

and Simulation 


Multiscale Analysis, Modelling and Simulation Unit 


in international symposia as well. www.waseda,jp/inst/sgu/en/unit/multiscale-analysis-modeling- 
The course is run in collaboration with researchers from Imperial College and-simulation 
Integrating methodology of political Advanced Study could complete their studies entirely in English. “We are 
science and economics for innovative hiring international faculty members to teach and conduct research at 
Waseda,’ explains Tanaka. “It's a stringent selection process, assessing both 
models solving global issues publications and teaching ability. We hired six international faculty in 2016, 
—Positive/Empirical Analysis of Political and are interviewing mare for the coming year’ 
Tanaka and colleagues are also collaborating with Stanford University and 
Economy the University of Essex to organize summer schools at the Waseda campus 
in Tokyo. Plans include expanding courses to contain wider global perspec- 


Ml [ t is almost impossible to separate 
economics and politics on the 

global stage,’ says Aiji Tanaka, head 
of the Positive/Empirical Analysis 
of Political Economy. ‘Raising taxes is 
both a political and economic decision. 
So is importing rice or fruit products. 
But universities in Japan teach politics 
and economics as separate subjects. 
Waseda University has combined these 
subjects in the Faculty of Political Science and Economics.” 

Tanaka is initiating further reforms to create an international hub 
for education and research. Notably, students at the Faculty of Political Positive/Empirical Analysis of Political Economy Unit 
Science and Economics, Faculty of Commerce, and Waseda Institute for www.waseda.jp/inst/sgu/en/unit/empirical-analyses-of-political-economy 


tives, hiring more international staff, and collaborating on positive theory/ 
empirical analyses of the behavior of political/economic actors and the 
governance of political systems and multinational corporations with the 
Said Business School at Oxford University, the School for Advanced Stud- 
ies in the Social Sciences, the University of California, San Diego, and other 
universities abroad. 

“Waseda University has a 130-year history of pioneering political 
economics—based education and research,’ says Tanaka. “Our faculty 
members are training our students to be fluent in the languages of both 
economics and politics, and we will continue this tradition on a global 
scale to resolve the challenges of the 21st century: 


Look out for more news from Waseda University in the March 23 issue of Science. www.sciencemag.org/advertorials 
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Advances in biosensors and microfluidic devices are driving 
a quiet revolution in biomedical research, which could lead 
to the reduction or elimination of animal use in many experi- 
ments. By Alan Dove 


or decades, laboratory biologists have regarded 

animal models as a necessary evil. While some 

activists decry their use on moral grounds, even the 

most practical-minded researchers acknowledge 

fundamental problems with them. Animals are 
expensive, provide only imperfect replicas of human biology, 
and introduce numerous variables into experiments that can be 
difficult or impossible to control. 

These flaws aren't purely academic. Pharmaceutical researchers 
have struggled for years with late-stage development failures, in 
which drugs that look promising in multiple animal systems turn 
out to be useless or even toxic in humans. Nonhuman models 
have simply been the least bad tool for detailed studies on hu- 
man biology. 

Thanks to advances in completely unrelated fields, though, that 
grim situation is starting to change. Using improved biosensors 
that can monitor microscopic compartments, and microfluidic de- 
vices that combine the miniaturized features of computing chips 
with living cells and tissues, researchers are now building systems 
that can reduce or even eliminate the need for laboratory animals 
while simultaneously yielding better data. 


Upcoming features 


Diamonds are for sensors 

Pharmacokinetics researchers have been among the most 
enthusiastic proponents of improving on animal models. Ideally, 
researchers would like to know exactly where a drug goes in the 
body and how and when it’s processed, generating a detailed 
history from dosing to metabolism to excretion. In practice, 
that has entailed cumbersome techniques such as medicating 
numerous animals and sacrificing them for analysis at different 
times. Besides being laborious and expensive, these experiments 
give only coarse measures of drug metabolism across time and 
body compartments. 

In recent years, investigators have implanted tiny, electro- 
chemical carbon-based sensors into animals, which provide mea- 
surements of metabolic changes in real time in a single animal. 
That's worked for natural signals such as dopamine and serotonin 
levels in the brain, but the high background noise and low dy- 
namic range of carbon probes make them poor choices for study- 
ing drug metabolism. 

Chemists and biologists are collaborating to overcome those 
limitations. Yasuaki Einaga, professor of chemistry at Keio 
University in Yokohama, Japan, has worked on electrochemical 
sensors for decades. Einaga’s group has found that boron- 
doped diamond probes are particularly good at detecting 
electrochemical changes in solutions. His researchers have tested 
these probes in systems ranging from wastewater treatment to 
chemical synthesis. 

“To further explore their biological applications, in 2007 [we] 
succeeded in downsizing the ... probes to micro scale,” says 
Einaga. Hiroshi Hibino, professor of molecular physiology at 
Niigata University School of Medicine in Niigata, Japan, saw the 
system’s potential for pharmacokinetics applications, and began 
collaborating with Einaga in 2011. 

The two labs have since found that boron-doped diamond 
probes can detect electrochemical changes caused by several 
classes of drugs in live animals and explanted organs, cont.> 


Microscopy: Live Cell Imaging—March 30 ® Molecular Biology: PCR Anniversary—May 11 m Proteomics: Big Data Sharing—June 15 
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providing real-time data on drug concentrations. "The number 
of drugs measurable by these sensors is much more than that of 
any conventional electrode,” says Einaga. In one recent paper, 
for example, the team accurately measured concentrations of 

a diuretic, an anticonvulsant, and a chemotherapeutic agent in 
guinea pigs’ inner ears. 

“[We] have a plan to test the 
kidney, [and] both groups are 
discussing a further collabora- 
tion to develop an implantable 
microsensing system to track [a 
drug] and its effects longitudi- 
nally in organs such as the brain,” 
says Hibino. He adds that the two 
labs are now in discussions with 
several other researchers who 
want to use the technology in 
fields ranging from oncology to 
psychopharmacology. 


Plumbing codes 

While improved biosensors 
can extract more and better data 
from each animal, they can also 
be combined with microfluidic 
devices to replace an animal 
outright, at least for some types 
of experiments. Microfluidic de- 
vice makers borrow techniques 
from the electronics industry to 
manufacture miniature labora- 
tories on chip-like wafers. The 
small size of the chips’ channels 
and chambers means that flu- 
ids can flow through them rapidly. Microfluidic devices can also 
include complex structures that mimic biological compartments, 
making cultured cells behave more naturally. Finally, the chips can 
be mass-produced on semiconductor manufacturing equipment, 
making them relatively cheap. 

Over the past few years, biologists have built a series of pro- 
gressively more complex microfluidic devices, which have essen- 
tially evolved into artificial, miniaturized human organs. Biosen- 
sors built into these systems allow researchers to watch in real 
time as, for example, a tiny human-like liver or kidney reacts to an 
incoming dose of a drug. 

The power of these new systems can be intimidating, though. 
“The biggest challenge for us instrument providers is to educate 
people,” says Fabien Crespo, head of marketing and sales at Elve- 
flow in Paris, France. Crespo adds that “people are kind of afraid of 
microfluidics, which is a lot about plumbing.” 

Elveflow and a few other companies have made this micro- 
scopic “plumbing” their main focus. “Researchers can now find 
microchips suited to their applications, and we are providing 
everything to control liquid flow inside those microchips,” says 
Crespo. Because the microfluidics field has standardized the 
different types of fittings that go into and out of chips, a single 
liquid-handling system can adapt to changing laboratory needs. In 
Elveflow's setup, for example, a point-and-click computer interface 
allows researchers to build multistep protocols controlling liquid 


Lung-on-chips in a multiwell plate format with ultrathin, porous, 
and elastic membranes, on which cultured lung cells can be 
cyclically stretched to mimic breathing motions. 


“We want to be able to take cells from a 


patient and test them ... and see which 
therapy is going to be best for that patient.” 
— Olivier Guenat, CEO of Alveolix. 


Produced by the Science/AAAS Custom Publishing Office 


flows. More advanced users can use a scripting interface to drive 
the system programmatically. 

Such flexible, modular systems are undoubtedly one reason 
microfluidics are becoming so popular. “We are seeing a lot of 
expansion in the field, especially over the last two years,” says 
Crespo. He adds that while the first users of microfluidics were 
mostly in academic research 
laboratories, he’s seen increasing 
demand from industrial 
scientists. That's likely driven 
by interest in developing new 
preclinical drug screening 
assays, but Crespo also expects 
microfluidics to start showing up 
in point-of-care diagnostic tests 
within the next few years. 


The government chips in 

The groundwork for the new 
boom in microfluidics began 
nearly a decade ago, through a 
combination of basic research 
and farsighted government as- 
sistance. When researchers in a 
few academic laboratories be- 
gan building “organ-on-a-chip” 
systems, administrators at the 
U.S. National Institutes of Health 
(NIH), the U.S. Food and Drug 
Administration, and the Defense 
Advanced Research Projects 
Agency took notice. “It became 
apparent ... that this was going to 
be a promising tool and technol- 
ogy,” says Danilo Tagle, associate director for special initiatives in 
the Office of the Director at NIH in Bethesda, Maryland. 

Tagle and his counterparts at other agencies convened a meet- 
ing with researchers in 2011 to discuss turning the new tissue and 
organ chips into practical models for drug testing and regulatory 
approval. “Numerous studies in the past few years have indi- 
cated that as much as 90% of the attrition in drug development 
is caused by failure to predict safety as well as efficacy when us- 
ing [cell] culture systems and in vivo animal models,” says Tagle. 
“What we're hoping [is that] these tissues on chips or organs on 
chips can fill in the missing information we need in order to have 
better success in drug development.” 

To help make that happen, NIH created the Tissue Chip for 
Drug Screening program in 2012. Collaborating with several aca- 
demic labs, the initiative’s five-year goal was to build organs on 
chips that could yield accurate predictions about drug responses 
in humans. Researchers funded by the program had to build 
devices that could keep cells alive in a setting that would mimic 
specific organs or tissues, and incorporate biosensors to measure 
the cells’ physiology. The program has now established several 
independent testing centers for organ chips, using over 100 
drugs that cleared traditional preclinical testing only to fail in clini- 
cal trials. "We're asking the question, ‘Would [a given chip] have 
predicted the adverse event that the 2D culture systems and the 
animal models were unable to predict?” says Tagle. Organ chips 
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www. hesperosinc.com 


that do well in these tests could be used to supplement or even 
replace animal data in future regulatory filings by drug makers. 
Having concluded its first phase in July, with several promis- 
ing systems being tested for assaying drug toxicity, the Tissue 
Chip program is now focusing on models for drug efficacy. These 
single- and multiorgan systems will be built to mimic particular 
diseases, including Parkinson's disease, amyotrophic lateral sclero- 
sis, and osteoarthritis. 


Deep breaths 

The main advantage organ chips have over traditional cell 
culture systems is their ability to simulate the complex structures 
and dynamics of intact organs. Instead of being restricted to flat, 
solid surfaces, chip makers can incorporate microscopic channels, 
curves, pores, and layers for the cells to populate. The carefully 
controlled, programmable fluid flows of a microfluidic device add 
to the realism of the technology, recreating the forces those same 
cell types would encounter in living humans. 

Lungs provide a good example of both the challenges and the 
potential of this approach. In the human lung, cells maintain a 
semipermeable barrier with two distinct sides, allowing gas ex- 
change between air and blood, while keeping the two fluids sepa- 
rate and withstanding regular cycles of flexing with each breath. 
Researchers at Harvard University’s Wyss Institute first mimicked 
this system with a lung chip in 2010. Emulate in Boston, Massachu- 
setts, is continuing development on that and other organ chips. 

The Boston team isn’t alone, though. “We saw this very interest- 
ing paper from the Wyss Institute, [and] we wanted to go further 
[with the concept],” says Olivier Guenat, CEO of AlveoliX in Bern, 
Switzerland. Guenat is also a group head at the University of Bern’s 
ARTORG Center for Biomedical Engineering Research, where 
his lab collaborates with clinical teams treating patients with lung 
diseases. 

To simulate the structure and three-dimensional deformation of 
a lung, Guenat’s team developed a platform that seeds cells onto 
thin layers of silicone containing regularly spaced holes. Only 3 
micrometers thick, the layers are nonetheless strong enough to 
withstand repeated, breath-like flexing. Having accomplished that, 
“the second biggest challenge was that we wanted ... to develop 
something that is very easy to use,” says Guenat. 

After several design iterations, AlveoliX now has a prototype 
system that maintains 12 lung chips on a standard-size multiwell 
plate. This arrangement enables users to handle and test the 
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chips with existing microscopes, plate readers, and other common 
laboratory equipment. Ultimately, “we want to be able to take cells 
from a patient and test them ... and see which therapy is going to 
be best for that patient,” says Guenat. 

The chips should also prove useful for preclinical studies, al- 
lowing scientists to control fluid flows, mechanical stresses, and 
other parameters with far greater precision than they can in living 
models, while simultaneously eliminating the challenges of animal 
handling. “I don't know any biologist who likes to sacrifice animals, 
and with organs on chips [now available], we really want to reduce 
animal testing,” says Guenat. 


Concerto for organs 

As organ chips become more established, scientists in the field 
are already pushing microfluidics and biosensor technology to the 
next logical step: multiorgan systems. In theory, one could simply 
pump culture medium through different chips in series, circulating 
drugs and metabolites within a high-tech homunculus. The reality 
turns out to be considerably more challenging. 

“There's just a lot of things that can go wrong when somebody 
tries to do this,” says Mike Shuler, president and CEO of Hesperos 
in Orlando, Florida. The company specializes in multiorgan chip 
systems. Shuler, who is also a professor of engineering at Cornell 
University in Ithaca, New York, says even people intimately familiar 
with multiorgan chip systems can have trouble getting them run- 
ning. “When I'm transferring technology from my [academic] lab 
down to the company, sometimes it takes a few iterations for us to 
get it to work right,” he says. 

As a result, Hesperos has built its business on offering multi- 
organ models as a service, rather than trying to sell and support 
them as stand-alone products. To date, the company has collabo- 
rated with several pharmaceutical companies interested in testing 
drug leads on chips. Shuler says Hesperos typically builds systems 
with four or five interconnected organs. “The liver is almost always 
the critical one, then cardiac and neuromuscular junctions ... are 
probably the [next] most popular,” he says, adding that the com- 
pany has also built systems incorporating artificial skin, gastroin- 
testinal tracts, and blood-brain barriers. 

One of the biggest challenges has been keeping the systems 
running long enough for extended metabolic testing. “We try to 
operate out to 28 days,” says Shuler. At that timescale, the differ- 
ence in solubility between oxygen and carbon dioxide in the cells’ 
medium can cause gas bubbles to accumulate, disrupting the 
system's tightly controlled fluid flows. Hesperos has addressed 
that problem by eliminating the pumps normally used to control 
microfluidic devices, and using a carefully designed gravity flow 
system instead. 

As the technology continues to develop, proponents of micro- 
fluidic systems expect their popularity to skyrocket in the next few 
years. "We're already being put into workflows for large and small 
pharma, [and] at this point we've been able to meet just about 
every single milestone that people have set out for us,” says James 
Hickman, Hesperos’ chief scientific officer. Shuler projects that hu- 
man organ chips “eventually will, | think, replace animals [or] so 
greatly reduce animal use that it’s a much less significant part of 
the drug development process, because you are getting data on 
human systems.” 


Alan Dove is a science writer and editor based in Massachusetts. 
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Cell Culture Flow Systems 

Quasi Vivo Systems are a series of 
advanced, interconnected cell culture 
flow systems, specifically designed to 
improve cellular phenotypes and human 
in vivo relevance of in vitro cell culture 
models. By introducing flow to the in vitro 
environment, Quasi Vivo enables you to 
generate models with improved predictive 
value, greatly improving confidence in 

the validity of your results. The system 
consists of interconnected cell culture 
chambers and a peristaltic pump to create 
a continuous flow of media over cells. It is available with three different 
culture chambers (QV500, QV600, and QV900) to support a wide range of 
applications, including submerged cell culture, coculture, and modeling of 
air-liquid and liquid-liquid interfaces. Not only is the system easy to set 
up, it also enables close monitoring of variables during an experiment. 
Furthermore, its large scale and user configurability allow researchers to 
perform assays that are not possible with microfluidic systems. 

Lonza 

For info: 800-638-8174 

www.lonza.com/quasi-vivo 


Micropatterned Hepatocyte Co-Cultures 

HepatoPac Kits are in vitro tools used for predicting likely in vivo 
outcomes and mechanisms of toxicity with respect to transport and 
metabolism of drugs and chemicals. HepatoPac plates contain mi- 
cropatterned co-cultures (MPCCs) of primary hepatocytes and stromal 
cells. This technology replicates the physiological microenvironment of 
the liver and allows hepatocytes to exhibit normal metabolic activity 
for over 4 weeks for both short- and long-term toxicology and efficacy 
studies during preclinical drug discovery. Hepatocyte health, function- 
ality, and liver enzyme activity can be extensively characterized on the 
HepatoPac platform. HepatoPac products are available with human, 
rat, dog, or monkey hepatocytes, or in multispecies formats. 
Ascendance Biotechnology 

For info: 781-391-0205 

ascendancebio.com 


Disease States Cell Culture 

HemaCare supplies researchers with a wide variety of well-characterized 
human disease-state primary cells and blood components through our 
U.S.-based collection network for use in basic, clinical, and translational 
research studies. The company can provide whole blood, plasma, serum, 
and peripheral blood mononuclear cells for key research areas from 
diseases such as acute myeloid leukemia, Crohn's disease, and syphilis. 
In addition to the products listed in the catalog, HemaCare can offer 
fresh and cryopreserved primary cells and blood components from 
other disease states—including various cancers such as renal cancer and 
multiple myeloma, allergies, and many other conditions. 

HemaCare 

For info: 877-310-0717 
www.hemacare.com/store/products/disease-state.html 
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Single-Cell Analysis System 

The Cyto-Mine Single-Cell Analysis System automates single-cell 
analysis, sorting, imaging, and dispensing. It is a single, compact, 
GLP-compliant system that fits in a standard biological safety cabinet. 
Traditionally, up to three different instruments would be required 

for each step, resulting in a costly, time-consuming process, using up 
valuable lab space, and increasing the risk of sample contamination. 
Cyto-Mine can deliver up to 10 million tests per day, far more than 
other systems on the market that deliver 10,000 tests in three 

weeks. The high-throughput system uses Sphere Fluidics’ patented 
picodroplet technology to encapsulate a single cell in growth media 
and trap secreted molecules, such as antibodies, from the cell as 

it grows. Since each cell is compartmentalized in the disposable 
Cyto-Cartridge, monoclonality is assured, and the unique workflow 
enables selective screening of single cells to find rare lead candidates. 
Sphere Fluidics 

For info: 888-258-0226 
www.spherefluidics.com/products/integrated-systems/cyto-mine 


Automated Pipetting Robot 

The BRAND Liquid Handling Station (LHS) pipetting robot handles 
routine tasks at high speed and with the highest precision. It closes 
the gap between electronic pipettes and complex, expensive 
pipetting robots. The LHS is a quiet, compact instrument with a 
unique front door, allowing it to be set up in tight quarters. The 
instrument has seven working positions and five available liquid 
ends with single-channel volumes up to 1,000 UL and multichannel 
volumes up to 300 UL. The system has the flexibility to move liquids 
between reservoirs, tubes, and plates. Applications include PCR, 
quantitative PCR, and ELISA setup; serial dilutions; microplate 
replication and reformatting; DNA normalization; cherry picking; 
cell cultures; and general liquid transfers. Adapters and racks bring 
consumables to an even height, reducing unnecessary vertical 
movements to save time during pipetting. The LHS is supplied with 
extremely user-friendly software—liquid handling protocols are 
created with ease. 

BrandTech Scientific 

For info: 888-522-2726 
www.brandtech.com/product/liquid-handling-station 


Syringe Pump 

Hamilton Company expands its PSD/4 precision syringe drive 
product family with the launch of PSD/4 Smooth Flow. The Smooth 
Flow delivers exact flow rates in nanoliters per minute with total 
dispense times up to 8 hours. It is perfect for engineers seeking an 
economical way to automate processes that were developed on lab- 
scale pumps, and is also ideal for flow cytometry and microfluidics 
applications where smaller sample volumes, faster analysis time, 
and more sensitive detection mechanisms require accurate and 
extremely small precision flow rates. The compact PSD/4 design now 
has 192,000 steps of resolution across the 30-mm stroke. The pump 
is compatible with a variety of catalog and custom valves as well as 
standard and long-life syringes from 12.5 uL to 12.5 mL. 

Hamilton Company 

For info: 800-648-5950 

www.hamiltoncompany.com 
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The Wenzhou FGF Team Seeks 
Top Talent To Discover Opportunities: 


Zoom in on the therapeutic applications of FGFs in China 


Based at both Wenzhou Medical University 
and Wenzhou University the research team led 
by Prof. Xiaokun Li is devoted to FGF basic 
and translational research with their clinical 
application as an ultimate goal. The therapeu- 
tic modalities using FGF formulations have 
evolved from the initial external and topical 
administration to implantable medical device 
as well as injection. The clinical indications 
have expanded from trauma and diabetic ulcer 
treatment, to neurological repair/regeneration. 
In the immediate future, FGFs will also be in- 
strumental in the treatment of major metabolic 
diseases, such as diabetes and atherosclerosis. 

The fibroblast growth factor (FGF) family 
is comprised of a group of structurally related 
protein ligands that signal through FGF 
receptor tyrosine kinases (FGFRs) to carry out 
a plethora of vital functions in development, 
metabolism, tissue homeostasis and repair 
after injury. Historically, translational research 
of FGF/FGEFR has focused on the oncology 
aspect in the West, whereas the research 
team led by Prof. Xiaokun Li based at both 
Wenzhou Medical University and Wenzhou 
University is devoted to FGF biotechnology 
and the delivery of engineered clinical grade 
FGFs for tissue repair and regeneration. 

The FGF family consists of 18 mammalian 
FGFs divided into 6 subgroups on the bases 
of their sequence homology, phylogenetics 


and structural characteristics. Five of the 
subgroups are considered canonical FGFs 
capable of high affinity binding to heparin 
sulfate (HS) and acting locally as paracrine 
molecules. The potential of FGFs in promot- 
ing cell proliferation, survival, angiogenesis, 
migration, and differentiation, have been 
explored for therapeutic applications in the 
setting of tissue repair/regeneration and also 
cancer therapy. Li’ s team, which started 
from Jinan University, Guangzhou, overcame 
several major bottlenecks for FGF protein 
engineering and recombinant production 
such as poor protein solubility and stability, 
enabling them to develop and license the first 
FGF protein drug (FGF2) in the world for 
clinical use. The clinical trial of the topical 
FGEF2 biologics, led by Prof. Xiaobing Fu and 
first reported in the Lancet, proved its benefi- 
cial effect in accelerating healing of burn 
wound, skin flap transplantation, and diabetic 
ulcers. As of June 2017, in China alone, FGF 
biologics have been used in 80 million cumu- 
lative patients/cases with great clinical and 
socioeconomic benefit. Importantly, the safety 
record of this treatment has been excelling; 
during a 20-year period post-FGF treatment of 
clinical follow-up, no excessive hyperplasia 
or other major adverse effects have been ob- 
served. The FGF biologics have transformed 
the clinical practice of trauma management 


from the traditional anti-infection and anti-in- 
flammation therapies to include FGF-induced 
coordinated pro-active repair and functional 
regeneration. In the past 10 years, Li and his 
colleagues have also successfully developed 
novel formulations for FGF1 and FGF2, as 
well as FGF7 and FGF10 biologics, which are 
currently in different phases of clinical evalu- 
ation. A non-mitogenic mutant FGF1 formula 
is also under preclinical evaluation for the 
treatment of Type 2 Diabetes. 

The endocrine FGFs are relatively new 
members of the FGF family with much 
reduced affinity toward HS and activate 
FGFRs with Klotho as a cofactor and exhibit 
distinct regulatory activity in various metabol- 
ic processes including glucose, lipid, bile acid, 
vitamin D and phosphate metabolism and 
energy homeostasis. These atypical FGFs, as 
exemplified by FGF21, present therapeutic 
potentials for a myriad of major metabolic 
diseases such as diabetes, obesity, cardiovas- 
cular and renal diseases, amongst others. 
Besides the aforementioned paracrine FGFs, 
Li’ s team has also undertaken major effort on 
basic and translational research on endocrine 
FGFs, particularly FGF21 and more recently 
FGF23. Li and his colleagues first discovered 
the role of adiponectin in mediating the meta- 
bolic effect of FGF21 on energy metabolism 
and insulin sensitivity, as well as protection 
against atherosclerosis. Currently the team 
has completed preclinical studies and is in the 
process of applying for CFDA approval for 
clinical trial on FGF21 for the treatment of 
diabetes. In collaboration with Prof. Moham- 
madi from New York University, Li’ s team 
provides structural insight into the activation 
of Phospholipase C by the concerted action 
of two FGF receptor molecules. A recent 
Nature publication further solved the atomic 
structure of the a-Klotho/FGFRI1c/FGF23 
ternary complex providing a basis for under- 
standing how klothos and endocrine FGFs 
interact, therefore opening up new avenues 
for structure-based drug design. Studies from 
Li’ s team have already unveiled unexpected 
therapeutic activity of FGF1 toward diabetic 
nephropathy and demonstrated that mitogenic 
and metabolic activities can be uncoupled by 
tuning FGF1/FGFR dimer stability. These 
mechanistic findings will likely lead to future 
drug discoveries targeting FGFs for the treat- 
ment of a variety of human diseases. 

Armed with the rich and validated experi- 
ence of “Chinese Style” protein drug develop- 
ment, the Wenzhou FGF team and its network 
of international collaborators, which together 
constitute the “Wenzhou FGF family” is push- 
ing ahead to make significant contributions 
to translate FGF research discoveries into 
clinical application in the years to come. 


The Wenzhou FGF team welcomes scholars from around the world to join them. To learn more about available opportunities, interested candidates 
should email (profxiaokunli@163.com or Zhang.jinsan@mayo.edu) the Wenzhou FGF team. 
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PRACTICAL SOLUTIONS FOR VIEWING AND PHOTOGRAPHING FLUORESCENCE 
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MORE INFORMATION... 
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Fluorescence has diverse applications in science, education, art, forensic Microscopy 


sciences, industry and more. NIGHTSEA provides complete solutions — light Sciences 
sources and matched filters — for viewing and imaging fluorescence. With both P.0. Box 550 * 1560 Industry Rd. 
off-the-shelf and customized equipment Fluorescence Viewing Systems have Hatfield, Pa 19440 


helped thousands of customers around the world enter this hidden spectral realm. | Tek (215) 412-8400 
Fax: (215) 412-8450 
email: sgkcck@aol.com 
or stacile@ems-secure.com 
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Step up your job search 
with Science Careers 


e Access thousands of job postings 
e Sign up for job alerts 
e Explore career development tools and resources 


Q Search jobs on ScienceCareers.org today _ 
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College 
LONDON 


Professor of Analytical & Forensic Sciences 


School of Population Health & Environmental Sciences, 
Department of Analytical, Environmental & Forensic Sciences 


We are seeking a resilient and inspirational Director able to work strategically 
within King’s Forensics. This full time permanent post is based within the Faculty 
of Life Sciences & Medicine, School of Population Health & Environmental 
Sciences and the Department of Analytical, Environmental and Forensic 
Sciences located at the King’s Waterloo Campus. It offers an exciting 
opportunity to establish and lead research in the field of analytical science within 
an internationally recognised and multi disciplinary university environment. 


It is anticipated that you will have research expertise in an area of analytical 
science that both complements and broadens our current research interests 
of toxicology, drug misuse in sport, forensic genetics, and environmental 
forensics including evidence detection. Good knowledge, preferably with 
practical experience, of operational forensic science is desirable. Applicants 
should have a track record of internationally excellent high impact research, 
demonstrated by a record of quality publications and proven ability to attract 
external funding. Experience of supervising postgraduate research students 
is essential. Applicants should have experience of developing and delivering 
innovative graduate and/or professional education. The salary for this post 
will be in accordance with the level of appointment. 


The appointment will be made, dependent on relevant experience and settled 
by negotiation. The minimum professorial salary at King’s College, London is 
£66,084 per annum, plus £2,923 London Weighting Allowance per annum. 


Further details and application packs are available on the College’s website at 
www.kcl.ac.uk/jobs. All correspondence should clearly state the job title and 
reference number A9/JVA/0474/18-SS. If you have any queries please contact 
the recruitment team at recruitmentteam2@kcl.ac.uk. 


The deadline for applications is midnight on 3rd April 2018. 
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=2) THE UNIVERSITY OF HONG KONG 


Founded in 1911, the University of Hong Kong is committed to 
the highest international standards of excellence in teaching and 
research, and has been at the international forefront of academic 
scholarship for many years. The University has a comprehensive 
range of study programmes and research disciplines spread 
across 10 faculties and over 140 academic departments 
and institutes/centres. There are 28,000 undergraduate and 
postgraduate students who are recruited globally, and more 
than 2,000 members of academic and academic-related staff 
coming from multi-cultural backgrounds, many of whom are 
internationally renowned. 


Post-doctoral Fellowships 


Applications are invited for a number of positions as Post-doctoral 
Fellow (PDF), at the University of Hong Kong. Appointments will 
be made for a period of 2 to 3 years and the appointees must 
be in post on or before February 28, 2019. 


PDF posts are created specifically to bring new impetus and 
vigour to the University’s research enterprise. Positions are 
available from time to time to meet the strategic research 
needs identified by the University. Positions are available in the 
following Faculties/Departments/Schools/Centres/Units: 


Faculty of Dentistry e¢ Department of Pathology 
Faculty of Education e Department of 
Department of Electrical Pharmacology and 

and Electronic Pharmacy 

Engineering School of Public Health 
Department of Mechanical Department of Surgery 

Engineering School of Biological 
School of Biomedical Sciences 

Sciences Department of Chemistry 
Department of Clinical Department of Physics 

Oncology Department of Statistics 
Department of and Actuarial Science 

Microbiology Department of Psychology 
Department of Paediatrics Partner State Key 

and Adolescent Medicine Laboratory for Liver 


Research 


Post-doctoral Fellows 

PDFs are expected to devote full-time to research. Applicants 
should be doctoral degree holders having undertaken original 
research that has contributed to the body of knowledge. A 
highly competitive salary commensurate with qualifications and 
experience will be offered. Annual leave and medical benefits 
will also be available. 


Procedures 

Prospective applicants are invited to visit our webpage at 
http://jobs.hku.hk/ to view the list of the Faculties/Departments/ 
Schools/Centres/Units and their research areas for which PDF 
positions are currently available. Before preparing an application, 
they should contact the Head of the appropriate academic unit, or 
the contact person as specified, to ascertain that their research 
expertise matches the research area for which a vacant PDF 
post is available. 


Applicants must submit a completed University application form, 
which should clearly state which position they are applying for; 
and in which academic discipline. They should also provide 
further information such as details of their research experience, 
publications, research proposals, etc. 


Application forms (341/1111) can be downloaded at 
http://www.hr.hku.hk/apptunit/form-ext.doc and further 
particulars can be obtained at http://jobs.hku.hk/. Closes April 
22, 2018. The University thanks applicants for their interest, but 
advises that only candidates shortlisted for interviews will be 
notified of the application result. 


The University is an equal opportunities employer and is committed to equality, ethics, 
inclusivity, diversity and transparency; and is committed to a Non-smoking Policy 
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By Meghan Wright 
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Instagram wont solve inequality 


cience Sam is a big name on campus. She’s a Ph.D. candidate in the sciences who wants to pur- 
sue a career outside of academia, like me. But unlike me, she is our school’s science communica- 
tion, or #scicomm, superstar. Her Instagram page, which aims to show the “fun and trendy” side 
of science, was recently celebrated in the school’s newsletter for increasing the public’s trust in 
scientists. At a career workshop, graduate students were urged to follow Science Sam’s example 
and use #scicomm to build our personal brands as we enter the job market. I already have 
an Instagram account, but it reflects my interests in photography and baking more than my love of 
science. The workshop got me thinking: Should my posts focus less on pastries and more on pipettes? 


I followed Sam on Instagram, and 
I soon found many other female 
graduate students and postdocs 
whose Instagram pages are filled 
with pretty selfies, fun videos, and 
microscope images captioned with 
accessible language and cute emo- 
jis. These researchers assert them- 
selves as scientists who don’t fit 
the stereotypes that are typically 
applied to women in the field. 
They are not boring or unfashion- 
able. Instead, their posts demon- 
strate that they're interested in 
clothes and makeup, that they’re 
physically active, and that they are 
attractive romantic partners. By 
visibly contradicting stereotypes 
about female scientists, it is clear 
that they hope to inspire girls to 
pursue science and to encour- 
age female scientists to showcase 
their femininity in our male-dominated work spaces. 

If I had seen these posts when I was 17, maybe I would 
have been more confident that I would excel in engineer- 
ing even though I spent my time in the darkroom develop- 
ing photos, not in the computer lab learning to code like 
the male engineers I saw in popular culture. But instead 
of cheering on Instagram’s dynamic and vibrant #scicomm 
women, I felt an increasing bitterness with each post I came 
across, though I struggled to understand why. 

After long conversations with other women in my lab, 
I realized that I am not bitter toward the authors of these 
posts. Although I am annoyed that the majority of the 
posts seem to celebrate a very narrow representation of 
femininity, my real bitterness comes from the systemic 
challenges that these posts are working to address, and 
from seeing so many young female scientists compelled to 
turn to their personal social media pages to try to correct 
the system’s failures. 


“Publicly documenting ... the 
sweet smile I brandish in 
the lab isn’t going to help me.” 


I have come to understand 
#scicomm on Instagram as a digital 
demonstration of the efforts that 
many female scientists exert daily 
to correct for gender disparities. Fe- 
male scientists spend demonstrably 
more time teaching, mentoring, and 
participating in community out- 
reach than their male colleagues, 
just as there are far more female 
#scicommers on Instagram than 
male. I liken the many hours that 
Science Sam spends on her Insta- 
gram content to the volunteer work 
a female professor in my depart- 
ment put into organizing a summer 
program to introduce teens from 
underrepresented communities to 
biomedical engineering. 

A lot of good comes from the 
work of female scientists on Insta- 
gram. But it disturbs me that these 
efforts are celebrated as ways to correct for the long held 
and deeply structured forms of discrimination and exclu- 
sion that female scientists face. I wonder whether our 
efforts should instead be directed toward advocating for 
policy changes at institutional and governmental levels. 

When I next interview for a job, I won't have an 
Instagram page to show that my love of science doesn’t 
make me boring and unfriendly. Publicly documenting the 
cute outfit I wear and the sweet smile I brandish in the 
lab isn’t going to help me build a fulfilling career in a field 
where women hold less senior positions, are paid less, and 
are continuously underrated. Time spent on Instagram is 
time away from research, and this affects women in science 
more than men. That’s unfair. Let’s not celebrate that. 


Meghan Wright is a doctoral candidate in the Institute 
of Biomaterials and Biomedical Engineering at the 
University of Toronto in Canada. 
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